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(54) NOVEL SUGAR-CHAIN SYNTHETASE AND PROCESS FOR PRODUCING THE SAME 

(57) Novel GalNAca2 f 6-siaIyltransferases P-B1 and 
P-B3; GallMAca2,6-sia]yltransferase genes encoding 
the above GaINAca2,6-sia)yitransferases P-B1 and P- - 
B3; and an extracellularty releasable protein catalyzing 
GalNAca2,6-sia!ic acid transfer which comprises a 
polypeptide portion as being an active domain of the 
GalNAca2,6-sialyttransferase P-B1 or P-B3 together 
with a signal peptide are provided. Also provided is a 
process for preparing a si aly {transferases which ena- 
bles efficient recovery of a siatyitransferase expressed 
in a large quantity in microorganisms. 



o 

Q_ 
LU 



Prtmadbv Rank Xarat ODO ftirsfrfra SeMoas 



EP0737745A1 



Description 

Technical Field 

5 The present invention relates to a sugar-chain synthetase and a DMA encoding the enzyme. More specifically, the 
present invention relates to an N-acety1galactosaminea2 I 6-slalyltransf erase (GalNAca2,6-sialyltransferase) and a DNA 
encoding the enzyme. The enzyme is useful as medicaments having inhibitory activities against tumor metastases and 
viral infection, and as agents for introducing a sialic acid moieties into drugs to increase their biological activity. 

The present invention further relates to a process for producing the sugar-chain synthetase. More specifically, the 

w present invention relates to a process for expressing sialyttransferases in microorganisms to obtain the sialyttrans- 
ferases in large quantities. 

Background Art 

is Sialic acids play an important rote in a variety of biological processes, like cell-cell communication, cell-substrate 
interaction, adhesion. It has been known that various kinds of distinguishable cell surface sialic adds exist which 
change in a regulated manner during development, differentiation, and oncogenic transformation. 

Sialic acids occur at the terminal positions of the carbohydrate groups of glycoproteins and glycoliptds, and they 
are enzymatically introduced from CMP-Sia to these positions in a post translational process. For example, three link- 

20 age patterns, Saa2,6Gal, Siaa2,3Gal and Siaa2,6GaiNAc are commonly found in glycoproteins (Hakomori, S., Ann. 
Rev. Biochem, 50, pp.733-764, 1981), and two, Siaa2,3Gal and Siaa2,8Sia, occur frequently in gangliosides (Fish- 
man, P., and Brady, R.O.. Science, 194, pp. 906-91 5, 1976). 

The enzymes responsible for such enzymatic introduction of sialic acid (sialic acid transfer) as mentioned above 
are glycosyltransf erases called sialyttransferases. It has been known that at least 12 cBfferent sialyltransferases are 

25 required to synthesize all known sialyloligosaccharide structures (Broquet, P. etal., Int J. Biochem., 23, 385-389, 1991 ; 
and Weinstein, J. et a!., J. Biol. Chem., 262, 17735-17743, 1987). Among these enzymes, five sialyltransferases have 
been purified so far, and it has been known that they exhfcit strict specificity for acceptor substrate (Sadler, J. et al., J. 
Bio. Chem. t 254, pp.4434-4443. 1979; Weinstein, J. et al., J. Biol. Chem., 257, pp.13835-13844, 1982; Rearick, J. et 
al., J. Biol. Chem., 254, pp.4444-4451, 1979; and Joqiasse, D.H. et al.. J. Biol. Chem., 260. 4941-4951. 1985). 

30 As for cDNAs encoding the aforementioned sialyltransferases, cDNAs encoding Gaipi ,4GlcNAca2, 6-siaJyl trans- 
ferase (Gaip4GlcNAc-a6ST) have been cloned from various organs including liver (Weinstein, J. et al., J. Biol. Chem., 
262, pp.1 7735-17743, 1987; Grundmann U. etal., Nucleic Acids Res. 18, 667, 1990; Bast, B. et al., J. Cell. Biol., 116, 
pp.423-435, 1992; and Hamamoto, T. et al., Bioorg. and Medic. Chem., 1. pp.141-145, 1993). Furthermore. cDNAs 
encoding Gaipi,3GalNAca2,3-sialyltransferase (Gaip3GalNAc-a3ST) (Gillespie, W et al., J. Bid. Chem., 267. 

3$ pp.21004-21010, 1992: Japanese Patent Unexamined Publication No. 5-504678/1993; and Lee, Y. et al., Eur. J. Bio- 
chem, 216, 377-385. 1 993); Galpl ,3(4) GlcNAca2,3-sialyItransf erase (Gaip3(4)GIcNAc-a3ST) (Wen, D. X et al., J. Biol. 
Chem.. 267, 21011-21019,1992; and Wtagawa, H. et al., Biochem. Biophys. Res. Commun. 194, 375); and 
Gaipi.3GalNAc/Galp1 ,4GIcNAca2,3-sialyftransferase (Sasaki, K. et al., J. Biol. Chem., 268. 22782-22787, 1993) have 
also been cloned. 

40 With respect to GalNAca2,6-sialyltransferase. the isolation of this enzyme has been reported (Hakomori, S., Ann. 
Rev. Biochem., 50. 733-764, 1981). However, the enzyme has not been purified so as to be characterized as a single 
identifiable substance, and accordingly, the enzyme has not been practically used because of insuff ident reaction spe- 
cificity, stability, and quantitative availability. Furthermore, a cDMA sequence encoding GalNAca2,6-sialyftransferase 
(EC 2.4.99.3; GaJNAc-aSST) has not yet been cloned. 

45 Each of the aforementioned sialyttransferases whose structures having been revealed has a hydrophobic segment 
located at the NH r terminal region, and is a type II transmembrane protein immobilized to cell membrane by the hydro- 
phobic segment From this reason, a problem arises that expressed enzymes are immobilized to ceil membranes and 
are not capable of being extracellularly released, where expressions are carried out using vectors containing sialyl- 
transferase genes that are transfected into mammalian cells. Furthermore, another problem may arise, when the 

so expression is performed using mammalian cells, that enzyme expressions may be reduced as endoplasmic enzyme 
concentrations exceed certain levels. 

In order to solve the above problems, an extracellularly releasable fused protein may be prepared which comprises 
an active domain of a sialyttransferase and a signal peptide region. This method is characterized in that a siafyltrans- 
ferase can be readily recovered from a cell cultivation mixture, because the method involves the step of extracellular 

55 release of the fused protein which retains sialyl transfer activity and function as a sialyttransferase. However, where the 
expression of a sialyttransferase is performed using a mammalian cell, a transfected ceO may be unstable or trouble- 
some cultivation procedures are required. In addition, in order to obtain a large quantity of expressed sialyttransferase, 
a mass cell culture is essential for a long period of time, which may cause disadvantageous from viewpoints of cost and 
manufactural installations. 
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Processes are well known to those skilled in the art to obtain cloned cDNA sequence encoding an enzyme 
expressed in mammalian cells and prepare a recombinant vector containing a gene encoding the enzyme, per se, or in 
a soluble form, and to transform microorganisms with the vector. A desired enzyme can be produced, in a large quantity, 
by culturing the transfbrmant obtained by the aforementioned method to allow the microorganism expresses the 

5 enzyme, per se, or in a soluble form that has the desired activity. 

This process comprises, for example, a step of culturing a transformed microorganism and extracting an expressed 
enzyme by lysis of the microorganisms using lysozyme or the like. However, a large amount of insoluble or soluble pro- 
teins is expressed in the microorganisms in a short period of time, and such proteins may aggregate inside the micro- 
organisms to form proteinic aggregates or precipitates. Accordingly, it is necessary to extract the protein from such 

10 aggregates or precipitates. 

To extract the desired protein from the aforementioned aggregates or precipitates, generally employed methods are 
those using urea, guanidine hydrochloride and the like. In this approach, the expressed protein is generally subjected 
to denaturation using, for example, urea for solubilization (by an exposure of the hydrophobic region), and then to rena- 
turaton treatment The renatu ration may be achieved by removing the urea through dialysis. However, for the removal 

is of urea, a problem Is that optimal conditions including pH, salt concentration, and temperature must be chosen that are 
strictly specif ic to each of the enzymes, and this optimization of conditions is extremely time-consuming. If inappropriate 
conditions are applied, recovered enzyme may retain almost no activity, and therefore, the selection of the conditions 
for the renaturation is particularly important 

Accordingly, one object of the present invention is to provide purified GalNAco2,6-sialy!transferase. Another object 

20 of the present invention is to provide a DNA sequence encoding GalNAca2,6-sialyltransferase and an amino acid 
sequence of the enzyme by cloning a cDNA sequence that encodes GalNAca2,6-sia!yltransferase. Further objects of 
the present invention are to provide an extracellularly releasable protein comprising an active domain of the 
Gal NAco2,6-sialyltransf erase and to provide a process for a mass expression of said protein in microorganisms: K is 
also an object of the invention to provide a process for extraction of an expressed sialyltransferase from aggregate 

25 thereof in microorganisms and a process of efficient renatuation of the extract. 

SUMMARY OF THE INVENTION 

The present inventors conducted various studies to achieve the foregoing objects, and as a result, they succeeded 

so in cloning the cDNA encoding GalNAca2,6-sialyltransferase from chick embryo. The present invention was achieved on 
the basis of these findings. The present invention thus provides GaINAca2,6-sialyttransferase P-B1 characterized by 
the amino acid sequence disclosed as SEQ ID NO.1 in the sequence listings. The present invention also provides 
Gal NAca2,6-sialyltransf erase genes encoding the aforementioned amino acid sequence of GalNAca2,6-siaiyitrans- 
ferase P-B1, and as an embodiment thereof, a GalNAca2,6-siaiyttransferase gene characterized by the nucleotide 

$5 sequence from nucleotide Na1 to 1698 cfisdosed as SEQ ID NO.1 in the sequence listings. Also provided are recom- 
binant vectors comprising the above GalNAca2,6-sialyltransferase gene and plasmid XCEB-3034 as an embodiment 
thereof, transformants which are transformed with the above recombinant vector, and the active domain of GalNAca2,6- 
sialyltransferase characterized by the amino acids of No. 233 through 566 of the amino acid sequence disclosed as 
SEQ ID NO.1 in the sequence listings. 

40 The GaJNAco2,6-sialyl transferase P-B1 has activity of transferring sialic add to the 6-position of N -acetylgalactos- 
amine directly bound to a protein regardless of the presence or absence of a substituent on hydroxy! group at the 3- 
position. The structure of NeuAca2,6GaiNAc-protein is thus readily formed by the enzyme, which terminates further 
extension of the resulting sugar chain. Therefore, where a longer sugar chain is desired, a sugar chain synthetic 
scheme should be designed so that this enzyme can be employed after complete extension of a sugar chain. For this 

4$ reason, a sialyltransferase is highly useful which fails to transfer sialic acid to an N-acetyfgalactosanvne that has unsub- 
stituted 3-hydroxyl group and bonded to a protein via an a-glycoside linkage, but can transfer sialic acid to the 6-position 
of an ^acetylgalactosamine bound to a protein via an a-glycoside linkage, only when the hydroxy! group at 3-position 
is substituted with a galactose or a sugar chain having a galactose at its reduced terminus. 

Therefore, the inventors of the present invention cloned a cDNA from chicken testes that encodes Gal NAca2,6-sia- 

so lyltransferase having the aforementioned features, and as a result they achieved the present invention relating to the 
Gal NAca2,6-sia!yitransf erase P-B3 characterized by the amino acid sequence disclosed as SEQ ID NO.3 in the 
sequence listings. The present invention thus provides Ga!NAca2,6-sialyttransferase genes encoding the above amino 
acid sequence of the GaINAca2,6-sialyJtransferase P-B3, and as an embodiment thereof, the Ga!NAca2,6-sialyltrans- 
feras e gene having the nucleotide sequence of from nucleotide No.1 to 1212 as disclosed as the SEQ ID NO.3 in the 

55 sequence listings. The present invention also provides a recombinant vector comprising the above GalNAca2,6-sialyl- 
transf erase gene and plasmid XCEB3-T20 as an embodiment thereof, and a transformant being transformed with the 
above recombinant vector. 

The inventor of the present invention further conducted studies to provide an extracellularly releasable protein com- 
prising a portion, i.a active domain, that is derived from the structure of the aforementioned GalNAco2,6-sialyltrans- 
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ferase and is responsible for its activity. As a result, they succeeded in identifying a partial polypeptide of the above- 
described GalNAca2,6-siafyttransferase as being the active domain, and achieved the present invention directed to an 
extracellularly reieasable protein which comprises the polypeptide region together with a signal peptide and catalyzes 
GalNAca2,6-sialic acid transfer. As an embodiment thereof, protein SB-690 characterized by the amino acid sequence 

5 disclosed as SEQ ID NO.2 in the sequence listings. The present invention also provides genes encoding the above pro- 
tein, and as an embodiment thereof, a gene having the nucleotide sequence characterized by from nucleotide No. 1 to 
1065 cfisclosed as SEQ ID NO.2 of the sequence listings, and a recombinant vector containing the aforementioned 
gene and plasmid pcDSB-690 as an embodiment thereof. Further provided are a transformant being transformed with 
the above recombinant vector ami a process for preparing the aforementioned protein which comprises the steps of cul- 

70 turing the above transformant and recovering the above protein from the culture. 

In addition, the inventors found that a Gaipi,4GaJNAcd2,6-sialyttransferase with a highly restored activity can. be 
prepared by expressing mouse Gaipi ,4GalNAca2,6~sialyitransf erase in an insoluble form in Escherichia coli, followed 
by extracting the enzyme with urea and subjecting the enzyme to renaturation under optimal conditions, and thus 
achieved the present invention. In accordance with the present invention, there is provided a process for producing a 

is sialyltransferase which comprises the steps of: (a) expressing a sialyttransferase in a microorganism; (b) extracting the 
sialyttransferase with about 5 to 9 M urea from proteinic aggregates or precipitates accumulated inside the microorgan- 
ism and containing the enzyme; (c) diluting the extract obtained by the above step (b) with a renaturation composition 
to obtain a primary dilution containing about 1 to 4 M urea; (d) diluting the primary dilution obtained by the above step 
(c) with a renaturation composition to obtain a secondary dilution containing about 0.5 to 2 M urea; and (e) removing 

20 urea from the secondary dilution obtained by the above step (d) by dialysis to afford a renatured sialyttransferase. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows a restriction map of the cDNA done encoding GalNAca2,6:Sialyltransfefase P-B1 . In the figure, E rep- 
25 resents EcoRI; RV: EcoRV; P: Pstl; and B: BgllL 

Fig. 2 shows the result of hydrophobicity analysis of the GaINAca2,6-sialyltransferase P-B1 according to the 
present invention. In the figure, N-terminus of the protein is depicted at the left side and positive values indicate hydro- 
phobic regions. 

Fig. 3 shows the location of the active domain of the GalNAca2,6-sialyttransfaase P-B1 according to the present 
so invention and the result of comparison with the structure of protein SB-690 which has Ga!NAca2,6-sialyftransf erase 
activity and can be extracellularly released. In the figure, protein SB-BGL is a protein not having GalNAca2,6-sialyl- 
transf erase activity. 

Fig. 4 shows the result of comparison between the primary sequences of GalNAca2,6-siaIyltransferase P-B3 and 
GalNAca2,6-sialyttransferase P-B1 according to the present invention. In the figure, amino acids are represented by the 
35 one-letter abbreviations. 

BEST MODE FOR CARRYING OUT THE INVENTION 

As the most preferred embodiments of the present GalNAc«2,6-sialy1transf erases, Gal NAca2,6-sialyrtransf erases 
40 P-B1 and P-B3 are provided. The explanations set out below will detail GalNAca2,6-sialyItransferases P-B1 and P-B3 
as examples of the enzyme of the present invention. However, the GalNAca2,6-siaIyttransf erases of the present inven- 
tion are not limited to the Ga!NAca2,6-sialyi transferase P-B1 or P-B3. GaINAca2,6-sialyttransf erases comprising the 
active domain of the Gal NAca2,6-sialyltr an sf erase P-B1 and/or that of P-B3, both were first revealed by the present 
invention, or alternatively, those comprising one or more active domains of the Ga)NAca2,6-siaIyttransf erase in which 
45 the aforementioned acid sequence is partially changed or modified also fall within the scope of the present invention. A 
preferred example of such active domains as mentioned above is the active domain of the Gal NAca2,6-siaiy1 trans- 
ferase characterized by from amino add No.233 to 566 of the amino add sequence disdosed as SEQ ID NO.1 of the 
sequence listings. 

The methods for isolation of the respective cDNAs encoding G^lNAca2,6-sialyhransferase P-B1 and GalNAca2,6- 
so sialyttransferase P-B3 will be detailed in Examples set out below. However, the methods for isolation of the cDNAs 
encoding GalNAco2,6-siaIyl transferase P-B1 and Gal NAco2 ,6-sialyttransf erase P-B3 are not limited to those methods. 
One of ordinarily skilled artisan can readily isolate the desired cDNAs by referring to the methods described in the fol- 
lowing examples, or alternatively, by appropriately modifying or altering those methods. In addition, the nucleotide 
sequences disdosed as SEQ ID Nos.1 through 3 in the sequence listings may be synthetically prepared and used to 
55 carry out the present invention. 

The DMA sequence encoding GalNAca2,6-sia)yttransferase P-B1 as defined by SEQ ID Na1 in the sequence list- 
ings and the DNA sequence encoding GalNAco2,6-sialyttransferase P-B3 as defined by SEQ ID No.3 are the preferred 
embodiments of the present invention. However, the DNA sequences encoding GalNAca2,6-sialyltransferase P-B1 or 
Gal NAca2,6-sialyttransf erase P-B3 of the present invention are not limited to those specified embodiments, and any 
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one of DNA sequences encoding the respective amino add sequences of GaINAca2,6-sialyltransferase P-B1 and 
GaINAca2,6-sialyttransferase P-B3 revealed by the present invention fall within the scope of the present invention. For 
example, the DNA sequence encoding the active domain of Gal NAca2,6-sialyttransf erase characterized by the amino 
acids of from Na 233 to 566 of the amino acid sequence as defined by SEQ ID Nat in the sequence listings is a pre- 
5 f erred embodiment of the present invention, in addition, the DNA characterized by the nucleotides sequence of from 
nucleotide Na 699 to 1698 of the SEQ ID No.1 shown in the sequence listings is a particularly preferred embedment 
of the present inventioa 

The GalNAca&6-sialyltransferases of the present invention, including P-B1 and P-B3 for example, may occasion- 
ally be retained inside the cells after expression and not released extracellular!/ Furthermore, when endoplasmic con- 

10 centrations of the enzymes exceed certain levels, expressed amounts of the enzymes may possibly be reduced. In 
order to efficiently utilize the aforementioned GaINAca2,6-sialic acid transfer activities of GalNAca2,6-sialyitransf erase 
P-B1 and P-B3, proteins in soluble forms may be prepared in which the activities of these enzymes are retained and 
can be released extracell ularly from cells upon their expressions. Examples of such proteins include, for example, extra- 
ceilularly releasable proteins which comprise a polypeptide, as being an active domain of the above-described 

is GalNAca2.6-sialyttransferase P-B1 or P-B3 and is responsible for the GalNAca2,6-sialyttransf erase activity, and cata- 
lyze the GalNAca2,6-sialic acid transfer. 

The sialyttransferases so far cloned have domain structures similar to those of other glycosyltransferases: a short 
NH2-terminal cytoplasmic tail; a hydrophobic signal -anchor domain; a proteolytically sensitive stem region; and a large 
COOH-terminal active domain (Paulson, J.C. and Colley, K.J., J. Biol. Chem., 264, 17615-17618, 1989). To determine 

20 the location of the transmembrane domain of the GalNAca2,6-sialyttransf erase P-B1 of the present invention, hydrop- 
athy plot may be used which can be prepared according to the method of Kyte and Doolittle (Kyte, J. and Doolhtle, R.F., 
J. Mol. Biol., 157, 105-132, 1982). To evaluate a putative active domain, recombinant plasmids introduced with various 
fragments may be produced and utilized. Exemplary methods will be detailed in the Examples set out below. However, 
the methods for determination of the location of the transmembrane domain or evaluation of a putative active domain 

2$ are not limited to the disclosed methods. 

For the preparation of the extracellularly releasable protein comprising a polypeptide portion, as being an active 
domain of the above-described GalNAca2,6-siaiy1 transferase P-B1 or P-B3. together with a signal peptide, an immu- 
noglobulin signal peptide sequence, for example, may be used as the signal peptide, and a sequence corresponding to 
the active domain of Ga!NAca2 t 6-sialyttransf erase P-B1 or P-B3 may be fused in-frame to the signal peptide. For exam- 

so pie, the method of Jotting et al. (Jobling, S.A. and Gehrke, L, Nature (Lond.), 325, 622-625, 1987) may be applied as 
such methods, whose specified procedure will be detailed in the Examples of the present specification with respect to. 
Ga)NAca2,6~sialyltransferase P-B1. However, types of the signal peptide and methods for ligation of the signal peptide 
and the active domain are not limited to the aforementioned methods, and a person skilled in the art can suitably 
choose the polypeptide portion as being an active domain of GalNAca2,6-sialyttransferase, preferably GalNAca2,6-sia- 

35 lyitransf erase P-B1 or P-B3 , and produce the extracellularly releasable protein by ligating the polypeptide portion to any 
available signal peptide according to an appropriate method. The most preferred example of these proteins is protein 
SB-690 of the present invention. 

According to another embodiment of the present invention, there is provided a process for producing a sialy [trans- 
ferase which comprises the steps of: (a) expressing a sialyl transferase in a microorganism; (b) extracting the sialyttrans- 

40 ferase with about 5 to 9 M urea from proteinic aggregate or precipitate containing the enzyme and being accumulated 
inside the microorganism; (c) diluting the extract obtained by the above step (b) with a renaturation composition to 
obtain a primary dilution containing about 1 to 4 M urea; (d) diluting the primary dilution obtained by the above step (c) 
with a renaturation composition to obtain a secondary dilution containing about 0.5 to 2 M urea; and (e) removing urea 
from the secondary dilution obtained by the above step (d) by dialysis to afford a renatured siafyitransferase. As 

45 described above, sialyltransf erases share the common domainal structure, and therefore, the preparation process of 
the present invention may be applicable to any type of siafyitransferase. For example, GalNAca2,6-sialyHransferase or 
Galpi ,4Ga3N Ac a2,6-sialyttransf erase of the present invention can be suitably prepared by the process of the present 
invention. 

. According to an embodiment of the process of the present invention, 8 M urea is used in the step (b); a primary 
so dilution containing about 2 to 3 M urea is obtained in the step (c); a secondary dilution containing about 1 to 2 M urea 
is obtained in the step (d); and the secondary dilution is dialyzed in the presence of divalent cations in the step (e). 
According to another embodiment of the present method. 8 M urea is used in the step (b); a primary dilution containing 
about 2 to 3 M urea is obtained by being left stand for 12 hours or more at 4 °C after primary dilution in the step (c); a 
secondary dilution containing about 1 to 2 M urea is obtained by being left stand for 48 hours or more after secondary 
55 dilution in the step (d); and the secondary dilution is dialyzed in the presence of divalent cations in the step (e). In addi- 
tion, it is also a preferred method in which the renaturation composition used in the step (c) contains 1 to 2 M urea, 20 
mM MOPS-NaOH, 0.5M NaCI, 20 mM lactose, 0.5 mM EDTA (pH 7.0) and the renaturation composition used in the 
step (d)contains 20 mM MOPS-NaOH, 0.5M Nad, 20 mM lactose, 0.5 mM EDTA (pH 7.0). 
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The first step of the process for the preparation of sialyttransferase according to the present invention is the expres- 
sion of a sialyttransferase in microorganisms. To this end, previously cloned genes of sialyttransferases can be used. 
As cDNAs encoding sialyttransferases, the cDNA encoding Gaipi ,4GlcNAca2,6-sialyttransf erase (Galp4GlcNAc-o6ST, 
see, Weinstein et al., Grundmann et a!., Bast et al. and Hamamoto et a)., supra), the cDNA encoding 

s Gaipi ,3(4)<3cNAca2,3-siaIyltransferase (Gaip3(4)GlcNAc-a3ST, see, Wen et at. and Kitagawa et at., supra), the cDNA 
encoding Gaipi»3GalNAc7Galp1,4GIcNAca2,3-sialyltransferase (see, Sasaki et al., supra), the cDNA encoding 
Gaipi.3GalNAca2,3-sialyltransferase (Gaip3GaiNAc-a3ST, see, Gillespie et al. ami Japanese Patent Unexamined 
Publication No 5-504678/1993; and Lee et al., supra), for example, may be used, as well as cDNAs encoding the 
Gal N Aca2,6-s alyttransferases of the present invention. Sialyttransferase genes contained in these nucleotide 

10 sequences, per se, may be used for the expression of the naturally-derived enzymes 

According to the present invention, in addition to the naturally-derived sialyttransferases mentioned above, non-nat- 
ural sialyttransferases in which the polypeptide sequences of the naturaJfy-derived sialyttransferases are partly deleted 
or modified may be expressed in microorganisms. For example, since sialyttransferases have a hydrophobic segment 
(transmembrane domain) in the Nhfe-terminal region, and sialyttransferases in soluble forms wherein the hydrophobic 

rs segment is deleted are preferably expressed in the microorganisms. In addition, deletion of both of the hydrophobic 
segment and the cytospl segment is also preferred. 

In order to produce recombinant vectors for the expression of sialyttransferases, the entire sequences or partial 
regions of the genes of naturally derived sialyttransferases may be selectively amplified by, for example, PCR method. 
For example, a sialyttransferase gene (a PCR fragment) may be readily prepared which has an initiation codon and a 

20 cloning site and lacks the cytosol domain and transmembrane domain. This type of sialyttransferase genes are suitably 
used for the introductions into vectors for microbial expressions due to the presence of the initiation codon and the clon- 
ing site. In addition, said genes are preferred since they encode non-natural sialyttransferases, in which a part of the 
polypeptide sequence of the naturally-derived sialyttransferase is deleted, and express non-natural soluble sialyttrans- 
ferase in microorganisms. 

25 According to the process of the present invention, microorganisms such as Escherichia cofi may be used for the 
expression of sialyttransferase. A microbial expression vector suitably used for transformation of such microorganisms 
may be suitably selected by an ordinarily skilled artisan. For example, where E. coli JM1 09(DE3) or the like is used as 
the microorganism, microbial expression vectors such as pET3b (Studier, F.W. et al., Method. Enzymol., 185, pp.60-89, 
1990) may be used. Methods for introducing the above described sialyttransferase genes into microbial expression vec- 

30 tors and methods for transforming microorganisms with recombinant vectors are both well known to those skilled in the 
art 

The transformants can be cultured according to methods for cutturing transformed microorganisms well known to 
those skilled in the art For efficient expression of a desired sialyttransferase in microbial cells, replication of the recom- 
binant protein can be initiated by, for example, the induction of T7-RNA polymerase during the logarithmic growth phase 
35 of the transformants. A large amount of naturally-derived or non-natural sialyttransferase is expressed inside the trans- 
formants thus obtained, which generally forms proteinic aggregate or precipitate. 

The second step of the process of the present invention is the extraction step of a sialyttransferase with 5 to 10 M 
urea from the proteinic aggregate or precipitate which is accumulated inside the cells and contains the sialyttransferase. 
In order to expose the proteinic aggregate or precipitate to outside of the microorganisms for its separation, the cultured 
40 transformants can be treated with, for example, lysozyme or Triton X-100 and thai insoluble fractions may be collected 
by centrifugation. After then, the precipitates are suspended in a buffer (for example, 10 mM Tris-HCI, pH 7.4) at a pro- 
tein concentration'of about 1 to 1 0 mg/ml and are subjected to extraction with urea. 

For example, solid urea is added to the suspension so as to be 5 to 10 M, preferably 8 M of final concentration, and 
the precipitates are subjected to extraction for 1 5 minutes to 2 hours, preferably 30 minutes at 4 to 25 °C, preferably at 
45 10°C. While not bound by any specific theory, the hydrophobic portion of a sialyttransferase contained in the extract is 
exposed by the action of urea, and as a resutt, a solubiiized sialyttransferase is extracted from the proteinic aggregates 
or precipitates. 

Then, an extract solution containing a denatured sialyttransferase can be obtained by removing the precipitates by, 
for example, centrifugation of the extract at 12,000 x g for 15 minutes. This extract normally contains about 0.5 mg/ml 
so of proteins. For example, when 5.7 M urea is used for the extraction, about 80% of proteins can be recovered. Further- 
more, upon the extraction, NaCt and Tris-HCI (pH 7.4) are preferably added so that their final concentrations of 0.3 M 
and 20 mM, respectively, are achieved. Exemplary procedure of the extraction will be explained in detail in the Exam- 
ples set out below. 

The sialyttransferase contained in the extract exposes hydrophobic portions and its higher-order structure is dam- 
55 aged. According to the process of the present invention, renaturation of the sialyttransferase contained in the extract is 
performed as the third step. The term renaturation herein used means restoration of the higher-order structure of the 
protein that is lost during the extraction step and the entire or partial recovery of the enzymatic activity. This step is char- 
acterized in that the extract is diluted stepwise with a renaturation composition so that the urea concentration can be 
gradually lowered to efficiently achieve the renaturation of the sialyttransferase. 
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The renaturation process comprises the steps of, for example, diluting the extract with a renaturation composition 
to obtain a primary dilution containing about 1 to 4 M urea; diluting the primary dilution with a renaturation composition 
to obtain a secondary dilution containing about 0.5 to 2 M urea; and removing the urea from the secondary dilution by 
dialysis to afford a renatured sialyltransferase.' 

5 A preferred embodiment of the process comprises the steps of, for example, diluting the extract with a renaturation 
composition to obtain a primary dilution containing about 2 to 3 M urea; diluting the primary dilution with a renaturation 
composition to obtain a secondary dilution containing about 1 to 2 M urea; and removing the urea from the secondary 
dilution by dialysis in the presence of one or more divalent cations to afford a renatured sialyltransferasa A further pre- 
ferred embodiment is a process comprises the steps of, for example, diluting the extract with a renaturation composition 

10 and the result is allowed to stand for 1 2 hours or more at 4°C to obtain a primary dilution containing about 2 to 3 M urea; 
diluting the primary dilution with a renaturation composition and the result is allowed to stand for 48 hours or more to 
obtain a secondary dilution containing about 1 to 2 M urea; and removing the urea from the secondary dilution by dial- 
ysis in the presence of one or more divalent cations to afford a renatured sialyltransferasa 

As the renaturation composition, for example, 2 M urea, 20 mM MOPS-NaOH (MOPS: 3-morpholinopropanesul- 

is fonic acid) (pH 7.0), 0.5M NaCI, io mM lactose, 0.5 mM EDTA; and 2 M urea, 20 mM Tris-HCl, 0.3M NaCI, 20 mM lac- 
tose, 0.5 mM EDTA (pH 7.4) may be used. In addition, a modified composition may be used in which the components 
of the latter composition may be changed to, for instance, 20 mM Tris-HCl (pH 8.0); 20 mM MOPS-NaOH (pH 7.0); 20 
mM MES-NaOH (pH 6.0) (MES: 3-morpho!inoethanesulfonic acid); 0.5 M NaCI; 0.1 M NaCI; or 1M urea. Furthermore, 
compositions not containing urea or lactose may also be used. Among these, 2 M urea. 20 mM MOPS-NaOH, 0.5 M 

20 NaCI, 20 mM lactose, 0.5 mM EDTA (pH 7.0) is preferably used. When the concentration of NaCI is below 0.1 M, or pH 
exceeds 9, renaturation efficiency is undesirably reduced. Generally, a salt concentration of 0.3 to 0.5 M and pH of 6 to 
8 are preferred after the addition of the renaturation composition. 

The first dilution comprises the step of preparing a primary dilution using the aforementioned renaturation compo- 
sition so that a final protein concentration of the extract is 0.01 to 0.05 mg/ml, preferably about 0.02 mg/m). For example, 

25 the extract may be diluted 10 to 40-fold, preferably about 20-fold, and a urea concentration may be 1 to 4 M, preferably 
not higher than about 3 M and not lower than about 2 M. Dilution treatment is generally and preferably performed at 
4°C. This primary dilution mixture is left stand for 12 hours or more at 4*C, most preferably for about 12 hours, to initiate 
gradual renaturation. 

The secondary (filution is carried out by diluting the primary dilution with an equal volume of renaturation composi- 
30 tion, preferably not containing urea, to achieve approximately the half urea concentration. Through this dflution, urea 
concentration of the secondary dilution should be lowered to about 0.5 to 2 M, preferably not higher than about 2 M and 
not lower than 1 M (e.g., 1 to 2 M), and most preferably at about 1.2 M. The secondary dilution is allowed to stand for 
40 hours to 2 weeks, preferably 48 to 72 hours, most preferably about 48 hours at 4°C, to proceed gradual renaturation. 
After then, to achieve perfect renaturation, the above-obtained secondary dilution is dialyzed against for example, 
55 a renaturation composition free from urea to completely remove remaining urea. The dialysis may be carried out at 4 
• °C for about 48 hours. Dialysis solution may be, for example, any one of buffer solutions in which the sialyftransf erase 
can be stored stably, as well as the renaturation composition. 

In addition, by carrying out the primary and secondary dilution and the final dialysis in the presence of one or more 
divalent cations, renaturation efficiency can be further improved. Examples of the divalent cations include, for example, 
40 magnesium ions and manganese ion& These ions may be used at a concentration of 1 to 10 mM, preferably about 5 
mM. It is particularly preferred that the dialysis is performed in the presence of one or more divalent cations. When a 
reducing agent such as dhhiothreftol and mercaptoethanol is added before complete removal of urea in the final dialysis 
step, the enzymatic activity may occasionally be lost However, after the urea is completely removed, the enzyme 
restores resistance to the reducing agent to exhibit the siafyltransferase activity. 
45 The present invention will be further explained more specifically by referring to the following examples. However, 
the scope of the present invention is not limited to these examples. 

Examples 

so (A) Preparation of GalNAca2.6-sialvltransferase P-B1 

In order to obtain acDNA clone of GalNAco2,6-sialyltransferase P-B1 , PCR with two degenerate oligonucleotides 
(ST-1 07 and ST-205) was performed using chick embryo cDNA as a template. A fragment of the desired size of approx- 
imately 150 bp was obtained. Among the PCR recombinants, one clone, designated as CEB1, was found to have a 
55 unique amino acid sequence distinct from the known sialyl motifs of GaI£4GlcNAc-a6STRL (residues 180-225), 
Gaip3(4)GlcNAc-a3STRL (residues 158-203), and Gaip3GalNAc-a3STPS (residues 144-189). The homologies of the 
sialylmotif of CEB1 with those of Galp4G!cNAc-a6STRL. Gaip3(4)GlcNAc-a3STRL and Gaip3GaJNAc-o3STPS were 
55%, 58% and 60%, respectively. 
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Screening of a 6-day-o!d chick embryo cDNA library with the cDNA insert from the CEB1 was carried out and as 
a result several cDNA dones were identified. Among them, clone XCEB-3043 contained a 2.7 kb insert (Fig. 1). To 
obtain other overlapping clones, a random-primeid cDNA library was again screened by hybridization with the 0.8 kb 
EcoRI-Bglll fragment of the S'-end of the XCEB-3043. Fifteen clones were isolated from the cDNA library. Among them, 

5 one clone, XCEBHAD contained a 220 bp insert overlapping with the 5 v -end of d on e XCE B-3 043 for 1 60 bp. . 

The combined DNA from these two cDNAs contained a 1 .7 kb of open reading frame that ends at a TGA terminal 
codon at nucleotide 1699. A poly adenylation signal (AATAAA) at 23 nucleotides upstream from the poly(A) sequence 
exists at the 3'-end. Translation of this open reading frame affords QaiNAca2 l 6-sialyltransferase P-B1 of the present 
invention (occasionally referred to as P-B1 in the the examples) of 566 amino acids with a molecular mass of 64,781 , 

io which starts with a methionine codon at nucleotide 1 with a conventional initiation sequence (Kozak, M., Nature (Lend.), 
308, 241-246, 1984). The cDNA including a gene encoding the GalNAca2,6-sialyttransferase of the present invention, 
the nucleotide sequence of XCEB-3043 as being the gene encocfi ng the GalNAca2,6-sia!yltransferase of the present 
invention, and the amino acid sequence of the GaINAca2,6-sialyKransferase P-B1 of the present invention are shown 
in the SEQ ID No.1 of the sequence listings. 

15 

Polymerase chain reaction (PCR) 

PCR was performed using degenerate primers [5* primer ST107: TGGGCCTTCMIKA/CJAGGTQTGCTQTTG, and 
3* primer ST205: AGGCGAATGGTAG inn G(A/T)GCCCACATC] deduced from conserved regions in Galp4GlcNAc- 
20 a6STRL (Weinstein, J. et a!.. J. Biol. Chem., 262, 17735-17743. 1987), Gaip4GlcNAc-a6STXP (Grundmann, U et al., 
Nucleic Acids Res., 18, 667, 1990), and Galp3GalNAc-a3STPS (Gillespie, W. et al., J. Biol. Chen.. 267, 21004-21010, 
1992). To obtain cDNA, poly(A)-rich RNA (2 fig) from 3 day-old chick embryos was incubated with an oligo-dT primer 
(Pharmacia), 1 mM each of dATP, dCTP. dGTP and dTTP, and 2 U/jil of RNase inhibitor (Promega) in 10 mM Tris-HCI 
(pH 8.3), 50 mM KG, 1 .5 mM MgCi 2 and 0:001 % gelatin in 50 nJ for 1 0 min at 0 °C, and then for further incubation was 
25 carried out for 60 min at 42°C after the addition of 1 00 nil Moloney murine leukemia virus reverse transcriptase (BRL). 
After heating the reaction mixture at 94°C for 3 min, cDNA prepared from 0.2 jig of poly(A)-rich RNA was used for 
the PCR experiment in a mixture comprising 10 mM Tris-HCI (pH 8.3), 50 mM KCi, 1 .25 mM MgCI 2 0.001% gelatin, 200 
jiM of each dATP, dCTP, dGTP and dTTP, 2U of Taq DNA polymerase (Promega), and 40 pmoles of each PCR primer 
in 50 PCR amplification, 35 cycles, was carried out each cycle consisting of denaturation at 96°C for 45 sec anneal- 
so ing at 50°C for 60 sec, and extension at 72°C for 60 sec. The PCR products were developed on a 3% agarose gel. The 
DNA fragment corresponding to 150 bp was eluted from the ge) (Qiaex kit; Qiagen), blunt-ended and kinated, and then 
subcloned into the Smal site of pUC1 1 9, and finally sequenced. 

. Construction of a cDNA library 

35 

Total RNA was prepared from chick embryos (6-day-old) by the guanidinium thiocyanate method, followed by cerv 
trifugation in a 5.7 M CsCI solution (Sambrook, J., Molecular Cloning: a Laboratory Manual, 2nd ed. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y. 1989). Poty(A)-rich RNA was purified with oligotex-dT30 (Takara), and then 
employed for the construction of a cDNA library using AZAPII (Stratagene) and cDNA synthesis (Pharmacia) kits with 
40 an oligo-dT primer and random primers. 

Screening of the cDNA library 

The amplified cDNA library (1 x 10 6 plaques) was screened with the chick embryo PCR fragments. The plaque- 
45 transferred filters were hybridized with 32 P-racSolabeled DNA probes for 1 2 h at 65°C in 5 x SSC, 0.02% SDS, 5 x Den- 
hardt's solution and 10 jig/ml denatured salmon sperm DNA and then washed twice at 65°C tor 20 min in 2 x SSC, 
0.1% SDS. To obtain plasmids from the isolated phage clones, phagemid rescue was performed according to the man- 
ual of the manufacturer of the AZAPII cloning kit (Stratagene). cDNA inserts were excised directly as Bluescript plas- 
mids. Plasmids were produced by the standard molecular cloning method according to Sambrook et al. (Sambrook, J. 
so et al., Molecular Cloning: a Laboratory Manual). 

DNA sequence analysis 

The DNA sequences of the inserts were determined by the dideoocy-chain termination method (Sanger, F. et al., 
55 Proc. Natl. Acad. Sci. USA 74, 5463-5467, 1977) using angle-strand DNA as a template for T7-DNA polymerase. The 
sequencing reaction and electrophoresis were carried out using the Auto Read DNA sequencing kit and a DNA 
sequencer (Pharmacia). Single strand DNA was prepared from Escherichia coli XL-Blue (Stratagene) after superinfec- 
tion with helper phage R408 (Stratagene). The sequence data were analyzed with a computer using PC/Gene (Teijin 
System Technology). . 
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Northern and Southern blot analyses 

To confirm the existence of the gene, Southern blot analysis of chick genomic DNA was performed. Hybridization 
with the EcoRl cDNA insert of XCEB-201 gave a single band for the DNA digested with EcoRI and BamHI. and two 
5 bands for the DNA digested with Hindlll and Sacl. This simple hybridization pattern indicates that the cloned cDNA is 
a single copy gena 

The transcription pattern during embryonic development was examined by Northern Hot hybridization. Analysis of 
RNA from 6, 8 and 10 day-old chic* embryos revealed two RNA species of 3.0 and 22 kfcx The 3.0 kb transcript was 
abundant and constantly expressed during all embryonal stages. A low level of the 2.2 kb transcript was detected in 6 

10 day-old embryos and its expression was decreased in 8 and 1 0 day-old embryos. The gene expression was analyzed 
using 10 jig po!y(A)-rich RNA obtained from various chicken tissues: brain, heart liver, lung, kidney, and testis. Very low 
levels of the 3.0 and the 4.0 kb transcripts was detected in testes, while almost no signals were detected in other tis- 
sues. The following description details each of the experiments. 

For Northern blots, 5 \iq of denatured poly(A)-rich RN As from chide embryo was size-fractionated on formaldehyde- 

is agarose gels and then blotted onto Hybond N+ nylon membranes (Amersham). For Southern blots, 7.5 jig of genomic 
DNA prepared from chick embryos was digested with restriction enzymes EcoRI, BamHI, Hindlll and Sacl, and then 
size-fractionated on 0.6% agarose gels. After electrophoresis, the gels were denatured (30 min) in 0.5 N NaOH and 1 .5 
M NaCI and neutralized (30 min) in 0.5 M Tris-HCI (pH 7.5) and 1.5 M NaCI, and then the DNA was transferred onto 
Hybond N+ nylon membranes. Both Northern and Southern filters were prehybricfized in 50% formamide, 5 x SSC, 5 x 

20 DenhardTs. 0.5% SDS, and 10 jig/rnl denatured salmon sperm DNA at 37 °C for 1 h, and then hybridized with a 32 P- 
labelled DNA probe for 12 h under the same conditions as for prehybridization. The probe applied was a 0.6 kb EcoRI 
cDNA insert of XCEB-201 , which was labeled with a Multiprime Labeling System (Amersham). The filters were washed 
twice for 10 min at 65°C in 2 x SSC and 0.1% SDS. followed by washing twice with 0.2 x SSC and 0.1% SDS at 65°C 
for 30 min, and then exposed to Kodak XAR fOm for about one day at -70°C. 

25 The amino acid sequence of the sialyltransferase P-B1 of the invention, which was revealed as described above, 
shows the following characteristic features that are not observed in siatyltransferases so tor known. 

(i) All of the sialyftransf erases previously cloned are critical Type II membrane proteins. They have a domain struc- 
ture similar to that of other glycosyl-transferases: a short NH r terminal cytoplasmic tail; a hydrophobic signal- 
so . anchor domain; a proteolytically sensitive stem region; and a large COOH-terminal active domain. On the other 

hand, the sialyltransferase P-B1 of the invention has a large stem region (or intermediate region). 

(ii) The sialyltransferase P-B1 of the invention has a PEST region (residues 233-258). It has been known that the 
amino acid sequences of proteins with intracellular half-Bves of less than 2 hours contain one or more regions that 
are rich in proline, glutamic add, serine, and threonine residues (referred to as PEST: Rogers, S. et aL, Science, 

35 234, 364-368, 1 986). These PEST regions are generally flanked by clusters containing several positively charged 
amino acids. Other sialyftransf erases previously known do not have this region. 

(Hi) Two stretches of eight amino acids (SSSXVSTC) were found at residues 247-254 and 330-337. A search of the 
Genebank database for other proteins revealed no sequence similarity to this sequence. 

40 Sialyttransferases so far known exhibit remarkable tissue-specific expression, which seems to be correlated with 
the existence of cell type-specific carbohydrate structures (Paulson, J.C. and Colley, KJ., J. Biol. Chent, 264, 
pp.17615-17618, 1989). The results of Northern blotting indicates that the pattern of expression of sialyltransferase P- 
B1 apparently changes. The transcriptions of three different sizes of mRNAs (4.0, 3.0 and 22. kb) from the sialyltrans- 
ferase P-B1 gene suggests that they are generated through alternative splicing and alternative promoter utilization 

45 mechanisms as observed for Gaipi,4GIcNAca2,6-sialyltransferase (Gaip4GlcNAoa6STRL) - and 
Galp1,3(4)acNAca2,3-sialyltransferases (Gaip3(4)GlcNAc-o3STRU Weinstein, J. et al., J. Biol. Chem., 262, 17735- 
17743, 1987; and Wen, D.X. et al., J. Biol. Chem.. 267. 2101 1-21019, 1992). This hypothesis is supported by the results 
of Southern hybridization, which showed the existence of a single copy gene for sialyltransferase P-B1 . 

so IB) Preparation of the soluble form protein SB-690 

In order to utilize the QalNAca2,6-sialyttransfer activity of the Ga!NAca2,6-s'a!yftransferase P-B1 of the present 
invention, protein SB-690 in a soluble form was prepared which retains the activity of the present enzyme and is 
released from the cells upon expression. 
55 The siatyltransferases so far cloned have a domain structure similar to that of other glycosyl-transferases: a short 
NH2-terminal cytoplasmic tail; a hydrophobic signal-anchor domain; a proteolytically sensitive stem region; and a large 
COOH-terminal active domain. To determine the location of any transmembrane domain of GalNAca2,6-sialy»trans- 
f erase of tie present invention, a hydropathy plot (Fig.2) was prepared from the translated sequence according to the 
method of Kyte and Doolittle (Kyte, J. and Doolittle, R.F., J. Mol. Bio)., 157, 105-132, 1982). As as result, it is suggested 
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that a critical hydrophobic transmembrane domain of GalNAco2,6-siaiyltransf erase P-B1 of the present invention con- 
sists of 21 amino acid residues from the amino adid No. 17 to 37. 

As described above, the hydrophobic signal anchor domain of GalNAca2,6-sialyttransferase is lacated from amino 
add resudie No.17 to 37. Residues from 233 to 269 apparently contain certain essential residues for the enzymatic 

5 activity, because the media from ceils transfected with pcDSB-BGL had no significant activity, while the protein (33 KDa) 
was synthesized in an in vitro translation/transcription system with pSB-BGL as a template. The active domain was thus 
deduced to be around 233-566 (Fig.3), which is a comparative size to that of other cloned sialyttransferases. In order to 
produce the soluble protein containing the active domain described above, the sequence relating to the putative active 
domain of P-B1 was in-frame fused to the sequence of immunoglobulin signal peptide (Jobling, SA and Gehrke! L, 

10 Nature (bond.), 325, 622-625, 1987). Details of the experiments are shown below. 

A vector plasmid pUQS was constructed by replacing the Pstl-Xhol fragment of the p Bluescript SK(+) plasmid with 
a 1 17 bp of a synthetic DNA fragment This fragment contains 43 bp of the ^-untranslated leader sequence of Alfalfa 
Mosaic Virus (Jobling, SA and Gehrke, L, Nature (Lond.) 325, 622-625, 1987) with a synthetic Pstl site at the 5*-end, 
followed by the mouse immunoglobulin M heavy chain signal peptide sequence (57 bp) (Boersch-Supan, M.E. et al., J. 

is Exp. Med. 161 , 1272-1292, 1985) with a 17 bp of a synthetic EcoRI, Bglll and Xhol doning site at the 3*-end. The nucle- 
otide 

5 1 -CTGCAGGGTTTTTATTTTTAATTTTCTTO 
20 OTCCACCATGAAATTCAGCTGGGTCATGTTOT 
TTCCAGATCTCGAG-3 ' . 

25 sequence of this fragment is 

XCE B-3043 encoding GalNAca2,6-sialyftransf erase of the present invention was partially digested with EcoRV, and 
1.8 to fragment was subcioned into EcoRV site of pBluescript SK(+) to generate pCEB-1800. This done lacks 0.8 kb 
of 3 -untranslated region of XCEB-3043. An active domain of GalNAca2,6-sialyltran$ferase P-B1 was generated by 
PCR using the 5'-primer, 5-AGGGCTGCTGAATTCACTGAGCCACAG-3' (nudeotides 679-708), with a synthetic EcoRI 

30 site at the middle of the primer and a 3* universal M13 sequendng primer and pCEB-1800 as a template. The PCR 
product was digested with EcoRI and Xhol, and then ligated into the EcoRI/Xhol site of pUGS to yield the plasmid pSB- 
690. In this plasmid, a sequence obtained by in-frame fusion of the 3'-end of the immunoglobulin signal sequence to the 
putative active domain of GalNAca2,6-sialyRransferase P-B1 was contained. The fusion fragment was excised from 
pSB-690 with Pstl and Xhol, and then inserted into the Pstl/Xhol site of expression vector pcDSRato yield pcDSB-690. 

$$ As a control, protein SB-BGL which lacks the active domain of GalNAca2,&-stalyttransferase was produced as 
described below. pCEB-1 800 and pUGS were digested with Bglll, and the protruding ends were filled by using the We- 
now fragment of DNA polymerase. After heat denaturation of the Wenow fragment of DNA polymerase (at 94 °C for 20 
min), these plasrrods were digested with Xhol. The 1.0 kb fragment from pCEB-1800 was gel purified and subdoned 
into the blunt-ended BgJII/Xhol site of pUGS to yield pSB-BGL The Pstl/Xhol fragment from pSB-BGL was subdoned 

40 into the Pstl/Xhol site of pcDSRa to generate pcDSB-BGL 

Expression of the above described protein was performed as follows. COS-7 cells were transiently transfected with 
5 |tg of plasmid DNA using the DEAE-<textran method (McCutchan, J.H. and Pagano, J.S. J. Natl. Cancer Inst 41, 
pp.351-357,1968). The media were harvested after 48 h transfection and then concentrated 1 0 times on Centricon 30 
fitters (Amicon) for the enzyme assay. For metabolic labeling, COS ceRs (60-mm culture dish) were washed with Met- 

45 free medium (Dulbecco's modified Eagle's medium and 2% fetal calf serum) (GIBCO) and then incubated for 1 h with 
the same media The cells were pulse-labeled with 10 MBq/dish of Express protein labeling mix (Du Pont-New Eng- 
land Nudear) in 1 .5 ml of Met-free media for 2 h. These ceDs were then washed with Met-free media and chased for 5 
h in media without Express-label. The media containing secreted proteins were harvested, concentrated 10 times, and 
then subjected to SDS-PAGE, followed by fluorography. 

so The enzyme activity of the expressed protein was measured as follows. The assays using oligosaccharides and 
glycoproteins as acceptors were performed in the presence of 50 mM sodium cacodylate buffer (pH 6.0), 50 jiM CMP- 
[ 14 C-lNeuAc (0.9 Bqfcmol), 1 mgAnl of bovine serum albumin, 2 mg/ml of acceptor substrate and 1 pi of concentrated 
COS cell medium, in a final volume of 1 0 nJ, and were incubated at 30 c O for 2 h. At the end of the incubation period, 1 
pi of the assay mixture was applied to a Silica gel 60HPTLC plate (Merck, Germany). The plate was developed with eth- 

55 anol:pyridine:n-butanol:water:acetate (100:10:1050:3), and the radioactivity was visualized and quantified with a 
BAS2000 radio image analyzer (Fuji Photo Rim, Japan). The radioactivity remaining at the origin was taken as sia- 
lylated glycoprotein. 

Identification of the sialylated products was carried out as* follows. Asialo-BSM were resialytated with CMP- 
[ 14 C]NeuAc in pcDSBHB90 COS cell medium and p-etimination oligosaccharides were prepared, p-elimination was car- 
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ried out according to Carlson's method (Carlson, D.M., J. Biol. Chem., 243, 616-626, t968). Asialo-BSM (100 |tg each) 
was aalyiated with CMP-[ 14 C]NeuAc in pcDSB-690 COS cell mecfium under the same conditions as above, except that 
the incubation period was 12 h. The reaction was terminated by adding 500 jil of 1% phosphotungstic acid in 0.5M HQ, 
followed by centrifugation at 10,000 x g for 5 mia The pellets were washed once with the same phosphotungstic acid 

5 solution and once with methanol, dissolved in 0.5 ml of 0.05M NaOH and 1 M NaBH 4 , and then incubated 30 h at 45°C. 
At the end of the incubation period, the solution was neutralized with acetic acid to pH 6 and then lyophilized. The 
dehydrated products were dissolved in 50 yd of water, and then desalted by gel filtration on a Sephadex G-15 column 
(0.5 x 5 cm) equilibrated and etuted with water. The radioactive fractions were subjected to thin layer chromatography 
for identification of the products without further purification. NeuAca2 t 6GaiNAc-ol and GlcNAcpi ,3[NeuAca2 t 6]Gal- 

10 NAc-ol from native BSM in two different developing solvent were co-migrated. The ratio of the transferred sialyl residue 
was 1 :0.9:0.6. The results of the co-migration of Salylated GalNAc-SerNAc with with NeuAca2,6GalNAc-SerNAc in the 
two different solvent systems indicate tint the protein SB-690 of the present invention forms the NeuAca2,6 linkage to 
GalNAc that is directly attached to Ser or Thr residues in glycoproteins. 

Media from cells transfected with pcDSB-690 contained sialyttransferase activity and it provide strong evidence that 

is the protein SB-690 of the present invention expressed by pcDSB-690 was secreted out of cells while retaining sialyl- 
transferase activity. On the other hand, media obtained from cells transfected with cDSB-BGL had no sialyttransferase 
activity. 

The acceptor specificity of the protein SB-690 of the present invention was examined with the concentrated COS- 
7 cell culture medium transfected with pcDSB-690. As shown in Table 1, asialo-mucin, fetuin and asialo-fetuin served 
20 as good acceptors. Remarkably, fetuin was shown to be a better acceptor than asialo-fetuin (BauWchon-Cortay. H. et 
al., Carbohydr. Res., 149, 209-223. 1986; and Brockhausen. I. et ah, Biochemistry, 29. 10206-10212, 1990). Other glyc- 
oproteins, oligosaccharides and glyooGpids did not serve as acceptors, except GalNAc-SerNAc. These data suggest 
that the acceptor ate is GalNAc directly attached to Ser or Thr residues in glycoproteins through an a-glycostde linkage. 
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Table 1 






Acceptor specificity of the protein SB-690 of the invention 


Acceptor 


Pmoles/hr/10 nl medium 


30 


Fetuin 


142 




Asialo-fetuin 


96 




a1 acid glycoprotein 


6 


35 


Asialo-a1 acid glycoprotein 


4 


Bovine submaxillary mucin 


15 




Bovine submaxillary asialo-mucin 


186 




Ovomucoid 


7 


40 . 


Asialo-ovomucoid 


0 




GaJpi ,3GteNAcp1 ,3Galp 1 ,4Glc 


0 




GaJp1,4GlcNAc 


0 


45 


Galpi,3GalNAc 


0 




GalNAcpi,4Gal 


0 




Galpi,4Glc 
Galactose 


0 


SO 


Ganglioside mixture 


0 
0 




Ganglioside GD1a 


0 




GalNAc-SerNAc 


4 


55 


Benzyl -GalNAc 


2 



* A number of 0 indicates lass than 1 pmol/hr/10 \il medium. 
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So far cloned sialy [transferases only exhibit acceptor specfficity for the GaNmoiety. While the GalNAca2,6-sialyl- 
transf erase P-B1 and protein SB-690 of the present invention exhibit acceptor specificity for the GalNAc- but not the 
Gal-moiety. The following evidence supports that GalNAca2 l 6-sialyHransferase P-B1 and the protein SB-690 of the 
present invention have the activity of Ga!NAca2,6-siatyttransferase, which transfer CMP-NeuAc with an a2,6-linkage 
5 onto a GalNAc residue O-Rnked to Thr/Ser of a glycoprotein; 

(i) The expression of pcDSB-690 in COS cells reveals the remarkable acceptor specificity for only the GalNAc moi- 
ety bound to Ser/Thr residues, while no detectable enzyme activity was found toward the other substrates tested 
(Table 1). 

10 (ii) The sialylated products obtained from bovine submaxillary asialo-muctn and GalNAc-SerNAc were shown to 
have sialic add bound to the GalNAc moiety through an a2,6-linkage. 

The two types. La, bovine submaxillary gland- and liver (brain)- types, of GalNAc-a6ST were reported, which have 
the different acceptor specificity (Bergh, M.E. et al. a J. Biol. Chem., 258, 7430-7436, 1983). The former enzyme has the 

is broad specificity toward GalNAc, Gaipi,3GalNAc and NeuAca2,3Gaipi ,3GalNAc, whereas the latter has only toward 
NeuAca2,3Gaipi,3GalNAc moiety of glycoproteins. The acceptor specificities of the GaiNAca2,6-siaJyttransf erase P- 
B1 and the protein SB-690 of the present invention were found to be similar to that of the former enzyme 

Examination of the acceptor site of asialo-mudn showed that NeuAca2,6GalNAc-Ser/Thr was the most abundant 
product However, considering the ratio of glycoconjugates in bovine submaxillary asialo-mudn, i.e., GalNAc-Ser/Thr, 

20 GlcN Acpi ,3GalNAc-Ser/Thr. and Galpt ,3GalNAc-Ser/Thr amounted to 65%, 25%. and 5%, respectively (Tsugi, T. and 
Osawa, T., Carbohydr. Res., 151, pp.391 -402, 1986), Gal N Aca2,6-aalytoransferase P-B1 and the protein SB-690 of the 
present invention seem to have the following acceptor preference: Gal{H,3GalNAc-Ser/Thr > GIcNAcpi,3Ga!NAc- 
Ser/Thr > GalNAc-Ser/Thr. On the other hand, the facts that almost all the radioactivity was released on weak alkali 
treatment and that fetuin is preferred over asialo-fetuin (Table 1) indicate that NeuAca2,3GaIp1 ,3GalNAc-Ser/Thr is a 

25 preferred substrate over GaIp1,3GalNAa-Ser/Thr, as reported for calf liver (Bergh, M.E. et aL. J. Biol. Chem.. 258, 
7430-7436, 1983) and rat brain (Baubichon-Cortay, H. et a!., Carbohydr. Res., 143. 209-223, 1986) GalNAca2,6-sialyl- 
transferases. 

The sialylation of GalNAc-SerNAc was much slower than that of corresponding residues on astalo-mucin (Table 1). 
Brockhausen et al. (Brockhausen et. aJ., Biochemistry, 29, 10206-10212, 1990) showed that a length of at least five 

30 amino acid is required for efficient synthetase activity. A similar effect of the peptide portion directly on GaIN Aca2,6-sia- 
lyttransferase P-B1 and the protein SB-690 of the present invention is also suggested from this observation (Table 1). 

The regents and the lite used in the above preparation examples (A) and (B) were as follows: Fetuin, asialo-fetuin, 
bovine submaxillary mucin, a1-add glycoprotein, galactose pi ,4-N-acetylgalactosamine, CMP-NeuAc, lacto-N-tetra- 
ose, benzyl-Gal NAc, N-acetyllactosamine, and Triton CF-54 were obtained from Sigma (St Louis, USA). 

35 CMP( 14 C]NeuAc(1 1 GBq/mmole) was obtained from Amersham (U.K.). ^Acetylgalactosamine p1,4-gatactose was a 
gift from Dr. Kapmoto (The Institute of Physical and Chemical Research, RIKEN, Wako-shi, Saitama-ken, Japan). 2- 
Acetamide and 2-deoxygalactosyl-aN-acetylserine (GalNAc-SerNAc) were synthesized according to Grundler and 
Schmidt (Grundler G., and Schmidt R.R., Liebigs Ann. Chem., 1984, 1826-1847, 1984). NeuAca2,6-GalNAc-SerNAc 
was prepared from NeuAccx2,6GalNAc-Ser (MECT) by acetylatibn with anhydroacetate in pyridine-water. 

40 NeuAca2,6GalNAc-ol and GlcNAcpi ,3[NeuAca2,6]GalNAc-ol were prepared from bovine submaxillary mucin accord- 
ing to Tsuji and Osawa (Tsuji, T. and Osawa T„ Carbohydr. Res., 151 , 391 -402, 1986) and identified by 270MHz 1 H and 
13 C NMR (Savage, A.V. et al., Eur. J. Biochem.. 192, pp. 427-432, 1990; and Savage, A.V. et al.. Eur. J. Biochem.. 193, 
837-843. 1990). Synthetic primers were synthesized with the Applied Biosystem 394 DNA synthesizer. Restriction 
endortucleases Smal, EcoRI, BamHI, Hindlll, Sacl, Xhol, Bglil and Pstl were from Takara (Japan). 

45 

(C) Preparation Pf GalNAccg.S-sialyttrangferasg P-P3 

In order to obtain cDNA clones of GalNAcot2,6-sialyltransferases, PCR with two degenerate oligonucleotides (ST- 
107 and ST-205) was performed with chic* embryo cONA as a template. The fragment of the desired size of approxi- 
50 mately 150 bp was purified by agarose gel electrophoresis. As a result of sequencing of the PCR products, it was 
revealed that they included those encoding Galp1,4GIcNAca2,6-sialyltransferase (Kurosawa. N., et al.. Eur. J. Bio- 
chem, 219, 375-381, 1994) and GalNAca2,6-sialyltransferase P-B1, as well as a PCR product encoding a novel amino 
acid sequence, pCRB3. The identity of the sialylmotif of pCRB3 with those of above-mentioned sialyltransferases was 
65 through 57%. 

55 In order to identify the complete coding sequence of the gene, a young chicken testis cDNA library was screened 
with the cDNA insert of pCRB3. The screening about 5 x 10 5 independent clones yielded one positive clone, XCEB3- 
T20, which has an insert size of 2.05 kb. 

The nucleotide sequence of the cDNA clone included an open reading flame of 1212 bp, coding for 404 amino 
acids with a molecular mass of 45.8 kDa. The open reading frame starts with a methionine codon at nucleotide 1 , with 
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a conventional translation initiation sequence (Kozak. M. Nature, 308, 241-246. 1984), and ends with a TGA stopcodon 
at nucleotide 1213. The open reading flame is flanked by a 5'-untrans!ated sequence of 384 bp and a 3'-untrans!ated 
sequence of 451 bp. The DNA sequence 5* of the initiation ate contains stop codons in all three reading frames. The 
nucleotide sequence and deduced amino acid sequences of XCEB3-T20 are shown in the SEQ ID No.3 of the 

5 sequence listings. The Ga]NAca2,6-stalyttransferase having this amino acid sequence was designated as P-B3. 

This GalNAca2,6-sialyltransf erase P-B3 (when the GalNAca2,6-siafyftransferase P-B1 is referral , to as 
ST6GaINAcA this enzyme is occasionally referred to as ST6QalNAcB) has type II transmembrane domain, containing 
a 17-amino acid N-terminal hydrophobic sequence bordered by charged residues, as has been found for all sialyltrans- 
f erases cloned to data Comparison of the primary sequence of GalNAco2,6-sialyltransferase P-B3 with other amino 

io acid sequences in DNA and protein data banks revealed similarities in two regions to all of the cloned sialyltransf erases. 
One region (sialytmotif L) in the center of the GaINAca2,6-sialyttransferase P-B3, consisting of a 45 amino acid 
stretch, shows 64-24% sequence identity, whereas the other, in the COOH-terminaJ portion (sialylmotif S, residues 333- 
355), exhibits 78-43% identity. The overall amino acid sequence identity of GalNAca2.6-sialyltransferase P-B3 is 10% 
to cW*Gaipi,4G]cNAa2,6-sialyltransferase (Kurosawa, N.. et aL. Eur. J. Biochem., 219, 375-381, 1994), 13% to chick 

is Gaipi ,3GalNAca2,3-sialyltransferase (Kurosawa N. et al., Biochem. Biophys. Acta., 1244, 216-222, 1995), and 32% to 
chick ST6GalNAcA (22), respectively. These results suggest that the cloned gene belongs to the sialyltransferase gene 
family. 

Details of the experiments are as follows. 

20 Polymerase chain reaction (PCR) 

PCR was performed using degenerate primers [5 1 primer ST107: TGGGCCTTGGII(A/C)AGGTGTGCTGTTG, and 
3' primer ST-205: AGGCGAATGGTAG 1 1 1 1 1 G^ATQGCCC ACATC] deduced from conserved regions In Gaip4GJcNAc- 
ofiSTRL (Weinstein, J. et aL. J. Biol. Chem., 262, 17735-17743. 1987), Galp4GlcNAc-a6STHP (Grundmann, U et al., 

25 Nucleic acids Res., 18, 667, 1990), and Galp3GalNAo-cflSTPS (Gillespie, W. et al.. J. Biol. Chem., 267. 21004-21010. 
1992). To obtain cDNA, poly(A)-rich RNA (2 jig) from 3 day-old chic* embryos was incubated with an oligo-dT primer 
(Pharmacia), 1 mM each of dATP, dCTP, dGTP and dTTP, ami 2 U/pl of RNase inhibitor (Promega) in 10 mM Tris-HCI 
(pH 8.3), 50 mM KG, 1.5 mM MgCfe and 0.001% gelatin in 50 pJ for 10 min at 0 °C, and then for additional 60 min at 
42°C after the addition of 1 00 \iU Moloney murine leukemia virus reverse transcriptase (BRL). 

30 After heating at 94 °C for 3 min, cDNA prepared from 0.2 pg of poly(A)-rich RNA was used for the PCR experiment 
in a mixture comprising 10 mM Tris-HCI (pH a3), 50 mM KCI. 1.25 mM MgCfe. 0.001% gelatin, 200 jiM of each dATP, 
dCTP, dGTP and dTTP, 2U of Taq DNA polymerase (Promega), and 40 pmoles of each PCR primer in 50 pi. PCR ampli- 
fication, 35 cycles, was carried out, each cycle consisting of denaturation at 96°C for 45 sec, annealing at 50 °C for 60 
sec, and extension at 72°C for 60 sec. The PCR products were developed on a 3% agarose gel. The DNA fragment 

35 corresponding to 1 50 bp was eluted from the gel (Qiaex kit; Qiagen), blunt-ended, Wnated, and then subcloned into the 
Smal site of pUC1 19, and finally sequenced. 

Construction of a cDNA library 

40 Total RNA was prepared from chick embryos (6 day-old) by the guanidinium thiocyanate method, followed by cen- 
trifugation in a 5.7 M CsQ solution (Sambrook, J., Molecular Cloning: a Laboratory Manual, 2nd edition). Poly(A)rich 
RNA was purified with OIigotex-dT30 (Takara), and then employed for the construction of a cDNA library using X2APII 
(Stratagene) and cDNA synthesis kits (Pharmacia) with an oligcxfT prima' and random primers. 

4$ Screening of the cDNA library 

The amplified cDNA library (1 x 10 6 plaques) was screened with the chick embryo PCR fragments. The plaque- 
transferred fflters were hybridized with ^P-racBolabeled DNA probes for 12 b at 65°C in 5 x SSC. 0.2% SDS, 5 x Den- 
hardt's solution, and 10 pg/ml denatured salmon sperm DNA The filters were then washed twice at 65°C for 20 min in 
so 2 x SSC, 0.1% SDS. To obtain plasmids from the isolated phage clones, phagemid rescue was performed according to 
the instructions of the manufacturer of the AZAPII cloning kit (Stratagene). cDNA inserts were excised directly as Blue- 
script plasmids. Plasmids were produced by the standard molecular cloning method according to Sambrook, et al. 
(Sambrook, J. et al., Molecular Cloning: a Laboratory Manual, 2nd ed.). 

55 DNA sequence analysis 

The DNA sequences of the inserts were determined by the dideaxy-chain termination method (Sanger. F. et al., 
Proc. Natl. Acad. Sci. USA 74, 5463-5467, 1977) using single-strand DNA as a template for T7-DNA polymerase The 
sequencing reaction and electrophoresis were carried out using an AutoRead DNA sequencing kit and a DNA 
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sequencer (Pharmacia). Single Strand DNA was prepared from Escherichia coli XL-Blue (Stratagene) after superinfec- 
tion with helper phage R408 (Stratagene). The sequence data were analyzed with a computer using PCTGene (Teijin 
System Technology). 

To confirm the existence of the gene, Southern blot analysis was performed for chicken genomic DNA. Hybridiza- 

5 tion of the cDNA insert of pCRB3 for chicken genomic DNA gave a single band on digestion with EcoRI and two bands 
with BamHI. This simple hybridization pattern indicates that the cloned cDNA was a single copy gene. Southern blot 
analysis of genomic DNA from mouse and monkey with the pCRB3 probe under low stringency conditions suggested 
that this gene is conserved across species. For Southern blot, each 7.5 jig of genomic DNA prepared from mouse brain, 
COS-7 cells and chicken testes were digested with restriction enzyme and then size-fractioned on 0.8% agarose gels. 

10 The mRNA size and distribution of the GaINAca2,6-sialyltransferase P-B3 gene were determined by Northern blot 
analysis. Analysis of RNA from 3, 6, 8, 10 and 12-day old embryos revealed two RNA species of 4.5 kb and 2.2 kb. The 
4.5 kb mRNA was expressed abundantly at all embryonic stages examined, while not expressed in adult tissues. The 
less abundant 2.2 kb mRNA was expressed at the early embryonic stage, being abundant at the late embryonic stage 
and in adult tissues. The size of the 2.2-kb transcript suggests that the obtained cDNA clone (XCEB3-T20) was dose 

is to full length. For Northern blots, 5 fig of poly(A)-rich RNAs from chick embryo and 1 0 jig of all RNA from chicken tissues 
were size-fractioned on formaldehyde-agarose gels. 

Sialyltransferases previously known exhfoit remarkable tissue-specific expression, which is considered to be corre- 
lated with the existence of cell type-specific carbohydrate structures (Paulson, J.C. and Colley, KJ. P J. Biol. Chem., 264, 
pp.1 7615-17618, 1989). The results of Northern blotting indicate that the pattern of expression of sialyftransferase P- 

20 B3 changes. The precise structure of embryo-specific 4.5 kb mRNA has not been known. However, the production of 
two different sizes of mRNAs from the sialyltransferas e P-B3 gene suggests that they are very likely to be generated 
through alternative splicing and alternative promoter utilization mechanisms as observed for Gaipi ,4GjcNAc-<x2,6-sia- 
lyltransferase (GaIp4GlcNAc-a6STRL) and Galp 1,3(4)GlcNAca2 l 3-sialyltransf erase (Galp3(4)GlcNAc-a3STRL) 
(Weinstein, J. et al., J. Bid. Chem., 262, 17735-17743, 1987; and Wen, DX et al., J. Biol. Chem., 267, 21011-21019, 

25 1992). This hypothesis is supported by the results of Southern hybridization, which showed the existence of a single 
copy gene for sialyttransferase P-B3. 

A 1.3 kb DNA fragment encoding the full length sialyltransf erase P-B3 was amplified using synthetic oligonucle- 
otide primers (5 , -ACXSGCG£aSSQ^CCAACCCGQAQAGCAQ(X3^^ and 5'-CGTTGC dGQAQAGTCCTTGCAGT- 
GGGACT-3\ synthetic Xhol site underlined). The amplif ied DNA fragment was digested with Xhol and inserted into the 

30 Xhol site of the expression vector pcDSRa(Takebe, Y.Mol. Cell. Biol., 8, pp.466-472, 1988) to yield recombinant plas- 
mid pcDB3ST. The insert of the plasmtd was sequenced to confirm the absence of possible polymerase chain reaction 
errors. 

COS-7 cells were transfected with 5 |xg of the recombinant plasmid pcDB3ST using the DEAE-dextran method 
(McCutchan, J.H. and Pagans J.S.. J. Natl, Cancer Inst., 41, 351-357, 1968). 

35 After 48 h of the transaction, the cultured cells (1 x 1 0 7 ) were harvested, washed with phosphate-buffered saline, 
and then resuspended in 2 ml of buffer comprising 20 mM MnCfe and 25 mM MES, pH 6.0. The cell suspension was 
centrifuged at 30,000 x g for 30 min, the cell pellet was resuspended in 0.5 ml or 1% Triton X-1 00, 50 mM NaCI, 5 mM 
MnCI 2 , 25 mM MES, pH 6.0, and then subjected to sonication. After centrifugation at 30,000 x g for 30 min, the super- 
natant was concentrated 10-fold on Centricoh 30 filters (Amicon), and then used for following assays. 

40 The enzyme assays with glycoproteins, oligosaccharides and glycolipids as acceptors were performed in the pres- 
ence of 0.1 M sodium cacodylate buffer (pH 6.0), 10mMMgCI 2 , 0.5% Triton CF54, 12jiMCMP-[ 14 qNeuAc(1.5kBq), 
1 mg/ml acceptor substrate, and 1 of COS cell lysate (in a final volume of 1 0 nJ), with incubation at 37 °C for 1 hr. At 
the end of the incubation period, the reaction mixtures were subjected to SDS-PAGE for glycoproteins as acceptors, or 
were subjected to chromatography on HPTLC plates (Merck, Darmstadt, Germany) with a solvent system of ethanol/1 - 

45 butanol/pyridine/acetic acid/water (1 00:1 0:1 0:3:30) for oligosaccharides and glycolipids as acceptors. Sialylated accep- 
tors were quantified with a BAS2000 radio image analyzer (Fuji Photo Film, Japan). 

Identifications of sialylated products were as follows. Reduced oligosaccharides were obtained from resialytated 
glycoproteins by ^-elimination as described by Carlson (Carlson, D.M., J. Biol. Chem., 243, pp61 6-626, 1968). 
AsialoBSM was sialylated with CMP-{ 14 C]NeuAc in a pcDB3ST-transfected COS-7 cell lysate under the same condi* 

so tions as above. The radiolabeled oligosaccharides released from fetuin were digested with NDV sialidase, and then 
subjected to thin layer chromatography for identification of the products without further purification. Oligosaccharides 
released from BSM were used as standards. AsialoBSM and asialofetuin were [ 14 C]-sialylated with the GalNAca2,6- 
sialyl transferase P-B1 and Gaipi ,3GalNAca2,3-sialyltransferase (Lee, Y.-C., et a!., Eur. J. Biochem., 216, pp. 377-385, 
1993). respectively, and the oligosaccharides were prepared by p-eliminatioa The resulting [ 14 qNeuAca2,6GalNAc- 

55 ol, Gaipi,3([ 14 C]NeuAcc2,6)GalNAc-d and [ 14 C]NeuAca2,3Gaipi,3GalNAc-ol were used as radio-labeled standards. 
When fetuin was used as the acceptor, the acceptor was only sialylated by the lysate of COS-7 cells transfected 
with pcDB3ST. The expressed Gal N Aca2 ,6-salyttransf erase P-B3 exhibited strong activity toward fetuin and asialofe- 
tuin, and weak activity toward asialoBSM, whereas no significant activity was observed toward BSM or other glycopro- 
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teins having only N-glycosidically linked oligosaccharides (e.g., a1-acid glycoprotein, ovomucoid, asia!b-a1 acid 
glycoprotein and asialo-ovomucold) (Table 2). 

In addition, ofigosaccharides or glycosphingolpds could not serve as acceptors for the GalNAca2,6-sialyJtrans- 
ferase P-B3 of the present invention. ( 14 C]NeuAc residues incorporated into fetuin by the enzyme were resistant to 

5 treatment with N-glycanase or NDV sialidase. The radiolabelled oligosaccharides released from fetuin were co- 
migrated with Gaipl ,3(NeuAca2,6)GalNAc-ol after treatment with NDV sialidase. These results indicate that sialic add 
residues were transferred through a2,6-linkages on GalNAc residues of Oglycosidically linked oligosaccharides of 
fetuin.' Thus, the expressed enzyme apparently has GalNAca2,6-siaIyltransferase activity. However, asialoBSM was a 
much poorer acceptor than fetuin and asialofetuin for this GalNAca2,6-sialyttransferase P-B3 of the present invention, 

10 The acceptor substrate specificity is different from that of the GalNAco2,6-sialyltransferase P-B1 for which asialoBSM 
serves as a much better acceptor than asialofetuin. 

To define the substrate specificity of the GalNAca2,6-sialyttransferase P-B3 of the present invention, fetuin was 
sequentially treated with sialidase (Vibrio cholerae) and p-galactosidase (bovine testes), and the resulting asialofetuin 
and agalacto-asialofetuin were used as acceptors. The incorporation of NeuAc-residues for the sialidase-treated fetuin 

75 was increased 1.5-fold of that for native fetuin. Three Oglycosidically Onked oligosaccharides are known to be con- 
tained in fetuin, two of which are NeuAca2,3Gaipi ,3GalNAc and the other is NeuAca2,3Gaip 1 ,3(NeuAca2,6)-GalNAc 
(Spiro, R.G. and Bhpyroa V. D., J. Biol. Chem., 249, 5704-571 7, 1 974). Accordingly, GalNAc residues in two of the three 
Olinked oligosaccharides can serve as acceptors in native fetuin, whereas those in all O-linked oligosaccharides in 
asialofetuin can be sialylated by the GalNAca2,6-sia)yftransferase P-B3 of the present invention. 

20 Furthermore, agalacto-asialofetuin could not serve as an acceptor of the GalNAca2,6-sialyltransferase P-B3 of the 
present invention, and only Qaipi,3([ 14 C]NeuAca2,6)"QalNAc-ol, but not [ 14 C]NeuAca2,6-QalNAc-ol, was detected for 
the oligosaccharides released from asialoBSM incubated with the enzyme by p-elimination. 

The characteristics of the GalNAca2,6-sialyttransferase P-B3 of the present invention revealed by the above exper- 
iments can be summarized as follows: 

25 

(1-i) Fetuin and asialofetuin, which contain the Oglycosidically linked (NeuAca2,3)Gaipi.3GalNAc sequence 
(Spiro, R.G. and Bhoyroo, V.D., J. Bio. Chem., 249, 5704-5717, 1974), served as good acceptors, but asialoBSM, 
in which only 5% of the total carbohydrate chains contain Galpi ,3GalNAc sequences (Tsuji, T. and Osawa, T, Car- 
bohydr. Res., 1 51 , 391 -402, 1 986), served as a much poorer acceptor; and 
so (1-0) the protein portion is essential for the activity of this sialyttransferase, since Gaipl ,3GalNAca1 -Bz as well as 
asialoGMt (Qalp1.3GalNAcpi,4Gaipi.3Glcpi-Cer) and GM1b (NeuAco2,3Qaipi,3GalNAcp1,4Gaipi,3Glcpi- 
Cer) did not serve as acceptors. 

(2) This sialyttransferase did not exhibit activity toward asialofetuin treated with p-galactosidase (agalacto-asialofe- 
tuin). 

35 (3) Only Galpi ,3Q 14 CJNeuAG-a2,6)Ga!NAc-ol was detected In the oligosaccharides released from [ 14 C]sialylated 
asialoBSM although about 60% of the carbohydrate chains of asialoBSM are GalNAc-O-Ser/Thr (Tsuji, T. and 
Osawa, T, Carbohydr. Res., 151, 391-402, 1986). 

These results clearly suggest that the acceptor substrate of the enzyme of the present invention having catalytic 
40 activity, i.e., transfer of CMP-NeuAc with an a2,6-linkage onto a GalNAc residue Olinked to Thr/Ser of a glycoprotein, 
requires Gaipi,3 GalNAc sequence of Oglycoside Onked oligosaccharide, whereas a2,3 (inkage-sialic acid residues 
linked to galactose residues are not essential for the activity. Therefore, the enzyme P-B3 first cloned by the present 
invention is a novel type of GalNAca2,6-sia!y!transferasa The primary sequence of GalNAca2,6-sialyttransferase P-B3 
from the 45 amino add regions at the molecular center (sialyl motH L) to the COOH-tenminal (residues: 1 80-404) exhibits 
45 high sequence homology to that of GaJNAca2,6-siaIyttransferase P-B1 (Fig. 4: the identity is 48%). The conserved 
regions unique to these GalNAco2,6-aaIyltransferases may be correlated with their enzymatic function of transferring 
sialic acid to the GalNAc-moiety via an <z2,6-tinkage. 
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Table 2 



Acceptor substrate specificity of GalNAca2,6-siafyltrans- 
ferase P-B3 of the invention 


Acceptor 




Specificity 


pmol/h/pJ enzyme fraction 


Fetuin 




28 


Asialcfetum 




35 


BSM 




0.5 


AsialpBSM 




5.2 


a1 -Acid glycoprotein 




0 


Asiaio-ai-aacj glycoprotein 




1.2 


Ovomucoid 




0 


Asialo-ovomucoid 




1.0 


Gaipi,3GalNAca1«Bz 




0 


GaiNAca1-Bz 




0 


GalNAc-SerNAc 




0 


AsialoGMI 




0 


GM1b 




0 


Gangliostde Mixture 




0 


0 indicates less than 0.5 pmol/h. 



The regents, samples and the like used in the above preparation example (C) were as follows. Fetuin, asialofetuin, 
35 bovine submaxillary mucin, a1-actd glycoprotein, galactose p1 ^-acetylgalactosamine, CMP-NeuAc. 
Gaipi^GalNAcal-Bz, GalNAca1-Bz and Triton CF-54 were obtained from Sigma (St Louis, USA). CMP-[ 14 qNeuAc 
(1 1 GBq/mmole) was obtains from Amersham (U.K.). 2-Acetamide and 2-deoxygalactosylaN-acetylserine (Gal N Ac- 
SerNAc) was synthesized according to Gruncfler and Schmidt (Grundler G., and Schmidt R.FL, Uebtgs Ann. Chem., 
1984, 1826-1847. 1984). NDV-sialidase ami siaTidase from Vibrio cholerae were purchased from Oxford Glycosystems 
40 (U.K.) and Boehringer Mannheim (Germany), respectively. p-Galactosidase from bovine testes was obtained from Boe- 
hringer Mannheim (Germany). Synthetic primers were synthesized with the Applied Biosystem 394 DNA synthesizer. 
Restriction endonucleases were obtained from Takara (Japan) 

(D) Purification of siaMtransferase expressed in microorganisms 

45 

Plasmid construction 

An initiation codon and cloning sites were attached by PGR to mouse Gaipi ,4G!cNAca2,6-Sialyltransferase cDNA 
(Hamamoto, T. et aJ., Bioorg. Median. Chem., 1, 14M 45, 1993). 5-TGGCATATGGGGAGCG ACTATGAGGCTCT- 

so 3'containing an Ndel site was used as a sense primer and 5 , -ATGAGGATCCCTGGCTCAACAGCG-3 > containing a 
BamHI site as an antisense primer. The resulting PCR fragment (1 152 bp) contained the initiation codon and a region 
coding for a polypeptide from the 29th amino acid residue to the C-terminal end of the enzyme, and lacked the cytosoiic 
and transmembrane domains. The fragment was incorporated into expression vector pET3b (Studier, F.W. et al.. 
Method. Enzymol., 185, 60-89, 1990) at the Ndel-BamHI site (located downstream of the T7 promoter). The resulting 

55 recombinant vector was named as pET3-MBS. The nucleotide sequence of the PCR fragment is shown as the SEQ ID 
No.4 in the sequence listings. 



4C 



EP0737745A1 



Enzyme expression 

E. coli JM1 09(DE3) cells transfected with the vector pET3-MBS were cultured in 1 00 ml LB medium supplemented 
with 100 \iqM ampicfflin at 37 °C. When the optical density at 600 nm reached 0.2-0.4, production of the recombinant 

s protein was initiated with induction of T7 RNA polymerase by the addition of 2 mM IPTQ (isopropylp-D-thiogalatopyra- 
noside). The recombinant enzyme, lacking the cytosolic and the transmembrane domain, was accumulated in the form 
of insoluble inclusion bodies in the cells. The growth rate of the JM109(DE3) cells transfected with pET3-MBS was the 
same as that of the non-transfected JM 1 09(DE3) cells both on agar plates and in liquid culture. After 2 h cultivation, the 
cells were harvested (ca 1 g wet weight), suspended in 10 ml of 20 mM Tris-HCI (pH 8.0), and then treated with lys- 

10 ozyme (0.1 mg/ml) and DNase I (0.01 mg/ml) for 30 min. Triton X-100 was added to a final concentration of 1%. and 
insoluble fraction was collected by cerrtrrfugation at 12,000 x g for 15 min at 4 °C. The precipitate was suspended in 3 
ml of 10 mM Tris-HCI (pH 7.4) and stored at -30°C before usa 

Solubilization and renaturation 

•15 

To 0.5 ml of the above suspension, 0.48 g solid urea, 60 pJ of 5 M Nad, 20 nJ of 1 M Tris-HCI (pH 7.4) and water 
were added to final volume of 1 ml (final concentration: 8 M urea, 0.3 M Nad; 20 mM Tris-HCI, pH 7.4). The precipitate 
was extracted for 30 min at 10°C, followed by centrifugation at 12,000 x g for 15 min. Most of the extracted protein had 
the molecular mass of 42k dalton. Where 5.7 M urea buffer was used for the extraction, 80% of the enzyme was recov- 
20 ered. 

The 0.1 ml aliquots of extract containing 8 M urea were diluted with each 1.9 ml of a renaturation composition 
(standard composition: 2 M urea, 0.5 M NaCI, 10 mM lactose, 0.5 mM EDTA, and 20 mM MOPS-NaOH, pH 7.0) to a 
final protein concentration of about 0.02 mg/ml. The solution was left at 40°C for 12 h, and then diluted again with an 
equal volume of the renaturation composition, thereby reducing the urea concentration to half (approximately 1.2 M), 
25 and then the mixture was left at 40 °C for additional 48 h. Then, sialyltransferase activity was measured to analyze the 
effects of the components of the renaturation composition at this point (Table 3). The resulting enzymes were further 
dialyzed against the renaturation composition to remove residual urea and the reducing agents over 48 h at 4 °C. The 
samples were concentrated approximately 20 times with Centricon-30 (Amicon). 

30 Sialyltransferase assay 

The activity of the sialyltransferase was measured with 50 fiM CMP-[ 14 C]NeuAc (0.9 Bq/pmole) as a donor sub- 
strate, and 5 mM Gaipi ,4GlcNAc (N-acetyOactosamine) as an acceptor substrata Reaction mixture was added with 1 
mg/ml bovine serum albumin, 1 of the enzyme solution, and 50 mM sodium cacodylate (pH 6.0) to a total volume of 

35 10 and inciibation was continued at 37 °C for 1 h. Then, the samples were appOed to silica ge!60 HPTLC plate 
(Merck Germany) and developed with ettmnol/jpyridine/n-butanot/acetic acid/water (100:10:10:3:30) as a developing 
solvent The radioactivity transferred on each plate was determined with a radio image analyzer BAS2000 (Fuji Photo 
Rim, Japan, Lee, Y.-C. et al., Eur. J. Biochem., 216, 377-385, 1993). One unit of enzymatic activity was defined as an 
amount catalyzing 1 nmole of sialic add transfer per minute. The acceptor preference as to oligosaccharide branches 

40 was examined using a N -acetyl lactosarrrin e type biantennary pyridylamino-ofigosaccharide as an acceptor substrate 
and analyzed f luorophotometrically by HPLC. 

When tiie 8 M urea extract was dialyzed without dilution at 4 °C, almost no activity of the enzyme precipitated at 
concentration of less than 0.5 M was recovered. The results of the optimum dilution conditions at 48 h after the second 
dilution are shown in Table 3 set out below. In the table, the standard renaturation composition was comprised of: 2 M 

45 urea, 20 mM Tris-HCI, 0.3 M NaCI, 20 mM lactose, and 0.5 mM EDTA (pH 7.4), and as to other compositions, deviations 
from the standard composition are indicated. 



so 
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Table 3 



5 


The effects of various conditions on renaturation of 
Galp1,4QalNAca2,6-sialyttransferase 


Renaturation conditions 


Relative activity compared to 
standard 




Standard comoosltion 


1 


10 


dH 9 5 Tris-HCI 20 mM 


0* 




nH8 0 Tris-HCI 20 mM 


06 




dH 7 0 MOPS-NaOH 20 mM 


2.5 


15 


pH 6.0, MES-NaOH 20 mM 


1.5 




0.5MNaa 


2 




0.1 M NaCI 


0.2 




0.01 M Nad 


0 


20 


0 mM lactose 


0.5 




1 M urea 


1.5 




0 M urea 


0.6 | 



* A value of 0 indicates less than 5% of the control 



The maximum renaturation was observed with 0.5 M NaCI (pH 7.0) in the standard composition, and these com- 
so positions were used in further experiments. After three independent renaturation experiments were carried out under 
this condition, total recovered activities were 0.4-0.8 mU/0.1 ml extract The enzymes at this stage of renaturation 
showed high Km values for CMP-NeuAc and N acetytlactosamine, 0.14 mM and 20 mM, respectively. Under the condi- 
tions tested, reducing agents (DTT and p-mercaptoethanol) inhibited the enzyme activity, which may due to carryover 
of urea at the concentration of 0.1 M in the assay mixture. In addition, very little activity was observed at 12 h after the 
35 second dilution, which apparently indicates that a refolding process of the polypeptide is very slow at the test tempera- 
tura Almost the same activity, as that in the process without the use of the reducing reagents, was obtained by the fol- 
lowing process: the 8 M urea extract was diluted with 20 volumes of the renaturation composition containing 2 M urea, 
20 mM MOPS-NaOH, pH 7.0, 0.5 M NaCI, 20 mM lactose, and 0.5 mM EDTA in the presence of 1 uM or 1 mM reducing 
regents, and then samples were left at 4 °C for 12 h and diluted to reduce the urea concentration to half, and the resid- 
40 ual urea and reducing reagents were removed by dialysis. The results are shown in Table 4. 



Table 4 


Reducing 


Specific 


regent 


activity (mU/mg) 


None 


7 


1 uM DTT 


6 


1 mM DTT 


12 



The substrate specif ictty of renatured mouse Gaipi,4G!cNAca2,6-sia!yltransferase was assayed using each 2 
mg/ml of substrates. The products were analyzed by HPTLC. HPTLC was performed using ethanol/pyridine/n-buta- 
55 nol/acetic acid/water (1 00:1 0:1 03:30) as a developing solvent when oligosaccharides and glycoproteins were used as 
acceptors, and chloroform :meihanol:0.5% CaCI 2 (55:45:8) as a developing solvent when glycofipids were used as 
acceptors. The substrate specificity and kinetic parameters of the renatured enzymes were similar to those of the 
enzyme obtained from rat liver. 
The results are shown in Table 5 and Table 6. 
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Table 5 





Substrate 


Relative Activity to Gaipi ,4G)cNAc 


5 




Renatured mouse 
Gaipi,4GlcNAca2,6-sia- 
lyl transferase 


Rat liver 
Galpl ,4GIcN Acoa2,6-sia- 
lyftransf erase 




Fetmn 


0.25 


0* 


10 


_ MSIaiUicIUin 


1.5 


0.97 




a1 acid glycoprotein 


0.1 


0.1 




Asiaio-ai acta glycoprotein 


2.1 


1 


15 


Bovine submaxillary mucin 


0 


0 




Bovine submaxillary asialo-mudn 


0 


0 




Lacto N-tetraose 




0 




GaI81 dGlcNAc 


1 


1 


20 


Qaipi,3GicNAc 


0 


0 




GaJNAcp1.4GaJ 


0 


0 




Gaipi,4G!c 


0 


0 


25 


Gal 


0 


0 



* A value of 0 indicates less than 2% of the control 



30 



Table 6 



Substrate 


Km(mM) 




Renatured mouse 
Gaipi,4GlcNAca2.6-sia- 
lyttransferase 


Rat liver 
Galpl ,4GlcNAcaa2,6-sia- 
lyltransf erase 


CMP-NeuAc' 


0.08 


0.04 


N-acetyilactosamine 


6.5 


5 


Asialo-orosomucoid" 


0.4 


0.2 



• Measured with N-acetyilactosamine as the acceptor. 
Concentration expressed as terminal galactose residues. 



GaJpi > 4GlcNAca2 t 6-sialyttransf erase is capable of recognizing the different branches of biantennary glycopeptides 
of the N-acetyflactosamine type (Joziasse, D.HL et al., J. Biol. Chem., 260, 714-719, 1985; and Van den Eijnden D.H. et 

50 al., Btochem. Biophys. Res. Comm., 92, 839-845, 1 980). A desialylated biantennary PA-oligosacchartde was sialylated 
by the enzyme renatured according to the method of the present invention and then analyzed with HPLC. The assays 
were performed using 10 pmoles of acceptor substrates and 0.1 mM CMP-NeuAc in a final volume of 5 uJ. The reaction 
mixtures were incubated at 37»C for 1 h, and the reaction was stopped by the addition of 90 uJ of cold water. To identify 
sialylated pyridylamino oligosaccharides, each reaction mixture was subjected to HPLC analyses equipped with a 

55 « reversed-phase column (Shimpack CLOODS, 0.6 cm x 15 an. Shimazu, Japan). The column was equilibrated with 
mixture of 70% solvent A (10 mM sodium phosphate, pH 3.8) and 30% solvent B (0.5% n-butanol, 1 0 mM sodium phos- 
phate, pH 3.8), and eluted at the flow rate of 1 ml/min with a linear gradient of solvent B to 60% over 30 min at 55°C. 
Pyridylamino oligosaccharides were detected f iuorophotometrically (excitation at 320 nm and emission at 400 nm), and 
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the results indicated that the renatured enzyme showed higher preference for galactose residues on Manal ,3 branches 
rather than for galactose residues on Manal ,6 brandies like the native enzyme. 

By competely remove urea, the renatured enzyme restored its resistance to reducing agents. In addition, more than 
10 times activation was recovered by renaturing with the addition of divalent cations. White not bound by any specific 
.theory, where dialysis is carried out for a prolonged period of time against a dialysis buffer containing 0.5 mM EDTA in 
the presence of urea, divalent cations, which are tightly bound to the enzyme to maintain the proper conformation of the 
enzyme, may be lost Where the enzyme was renatured in the renaturation composition containing 1 .2 M urea, the addi- 
tion of divalent cations increased the activity. The results obtained are shown in Table 7. In the table, the activities are 
shown as relative values to that obtained by no addition of reagents. The specific activity of the renatured enzyme was 
0.15 U/mg protein when measured with 5 mM MnCfe. which is about 2% of thai of the enzyme obtained from rat liver 
(weinstein, J. et al., J. Biol. Chem., 257, pp. 13835-1 3844, 1982). The overall recovery of the enzyme was 0.1 U/10Q ml 
culture medium. 



Table 7 



Reagent 


Renatured mouse 
sialyttransf erase 


Rat liver 

Oolpl .'HalCINnGOGi.O- 

sialyltransferase 


Reducing agent 






DTT (1 mM) 


1.0 


0.9 




1.1 


1.2 


Mercaptoethanol (1 mM) 


1.1 


1.1 


OHM) . 


1.0 


1.1 


Detergent 






Triton x-100(1%) 


1.5 


0.8 


(0.5%) 


1.4 


1.4 


(0.1%) 


1.3 


1.3 


Divalent cations 






MgCfefSmM) 


11 


1.0 


MnCfe (5 mM) 


13 


1.1 


EDTA (5 mM) 


1.7 


0.9 



The method of the present invention was specifically explained above referring to the examples relating to the 
Gaipi,4GcNAcoe2,6-sialyltransferase. However, the method of the present invention is not limited to these examples. 
As described above, unlike other glycosyltransferases, sialyttransf erases share highly conserved regions (sialylmotif, 
Livingston, B.D. and Paulson, J.C., J. Biol. Chem.,268, 11504-11507, 1993), and all of the sialyltransferases are con- 
sidered to have similar higher-order structures (Drickamer, K, Glycobiology, 3, 2-3, 1993). Therefore, it is readily under- 
stood by those skilled in the art that the renaturation procedure disclosed in the above examples for 
Galpl ,4GlcNAca2,6-sialyltransferase can be applied to renaturations of other sialyltransferases to achieve the same 
advantageous effects. Furthermore, those skilled in the art will be able to choose optimum renaturing conditions, not 
only for Gaipi,4GlcNAca2,6-sialyftransferase but for other sialyltransferases, by modifying or altering the processes 
disclosed in the specification. 

The regents and samples used in the above example (D) were as follows. Rat liver Qaipi ,4GlcNAca2,6-sialyltrans- 
ferase, fetuin, asialo-fetuin, bovine submaxillary mucin, crt-acid glycoprotein, galactose fH,3-N-acetylga!actosamine, 
tacto N-tetraose and N-acetyllactosamine were obtained from Sigma (St Louis, USA). Urea was purchased from Wako 
Pure Chemicals (Osaka, Japan) and a solution was prepared just before usa CMP-{ 14 C]NeuAc (1 1 GBqfmmole) was 
obtained from Amersham (U.K). Bovine submaxillary asialo-mudn and asialo-al-actd glycoprotein were obtained by 
mild acid treatment of corresponding glycoproteins. N-acetylgalactosamin e p1 galactose was a kind gift from Dr. 
Kajimoto (The institute of Physical and Chemical Research, RIKEN, Wako-shi, Saitama-ken, Japan). Pyridyiamino oli- 
gosaccharides (PA-sugar 001. 021, 022 and 023) were obtained from Takara (Kyoto, Japan). Protein concentrations 
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were determined with a BCA protein assay kit (Pierce) using bovine serum albumin as the standard. Dialysis tubing 
(20/32) was from Viskase. 

Industrial applicability 

5 

The novel GalNAca2,6-sialy»transferases P-B1 and P-B3, and proteins which contain a polypeptide part as being 
the active domain of said enzymes and are released extracellularly provided by the present invention are useful as, for 
example, reagents for introducing human type sugar-chain to proteins and medicament for treating hereditary diseases 
lacking human-specific sugar chains. In addition, they can be used as drugs for inhibiting tumor metastases, preventing 
10 viral infection, and controlling inflammatory reaction. Furthermore, the method of the present invention is useful when 
a large quantity of a siaJyltransferase is expressed in microorganisms, since it enables a mass recovery of the enzyme 
with highly restored activity from aggregate or precipitate inside the cells. 
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SEQUENCE LISTING 

(2) INFORMATION OF SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2027 

(B) TYPE: nucleic acid 

(C) STRANDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

(-31) CCGAGCTTCCATCTCTCCCGGGCCTCTCACT -1 

ATG GGG TTT TTA ATC AGA AGG CTT CCT AAA GAT TCC AGA ATA TTC 45 
MET Gly Phe Leu lie Arg Arg Leu Pro Lys Asp Ser Arg lie Phe 15 

CGT TGG CTC CTT ATT TTA ACA GTC TTT TCC TTC ATC ATT ACT AGT 90 
Arg Trp Leu Leu He Leu Thr Val Phe Ser Phe He He Thr Ser 30 

TTT AGC GCC TTG TTT GGC ATG GAG AAA AGC ATT TTC AGG CAG CTC 135 
Phe Ser Ala Leu Phe Gly MET Glu Lys Ser lie Phe Arg Gin Leu 45 



40 AAG ATT TAC CAA AGC ATT GCA CAT ATG CTA CAA GTG GAC ACC CAA . 180 

Lys He Tyr Gin Ser He Ala His MET Leu Gin Val Asp Thr Gin 60 

45 GAT CAG CAA GGT TCA AAC TAT TCT GCT AAT GGG AGA ATT TCA AAG 225 

Asp Gin Gin Gly Ser Asn Tyr Ser Ala Asn Gly Arg He Ser Lys 75 

50 GTT GGT TTG GAG AGA GAC ATT GCA TGG CTC GAA CTG AAT ACT GCT 270 

Val Gly Leu Glu Arg Asp He Ala Trp Leu Glu Leu Asn Thr Ala 90 
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GTG AGT ACA CCA ACT GGG GAA GGG AAG GAA GAG CAG AAG AAA ACA 315 
Val Ser Thr Pro Ser Gly Glu Gly Lys Glu Glu Gin Lys Lys Thr 105 



10 



GTG AAA CCA GTT GCC AAG GTG GAA GAA GCC AAG GAG AAA GTG ACT 360 
Val Lys Pro Val Ala Lys Val Glu Glu Ala Lys Glu Lys Val Thr 120 



15 



GTG AAA CCA TTC CCT GAG GTG ATG GGG ATC ACA AAT ACA ACA GCA 405 
Val Lys Pro Phe Pro Glu Val MET Gly lie Thr Asn Thr Thr Ala 135 



20 



TCA ACA GCC TCT GTG GTG GAG AGA ACA AAG GAG AAA ACA ACA GCG 450 
Ser Thr Ala Ser Val Val Glu Arg Thr Lys Glu Lys Thr Thr Ala 150 



25 



AGA CCA GTT CCA GGG GTG GGG GAA GCT GAT GGG AAG AGA ACA ACG 495 
Arg Pro Val Pro Gly Val Gly Glu Ala Asp Gly Lys Arg Thr Thr 165 



30 



ATA GCA CTT CCC AGC ATG AAG GAA GAC AAA GAG AAG GCG ACT- GTG 540 
He Ala Leu Pro Ser MET Lys Glu Asp Lys Glu Lys Ala Thr Val 180 



35 



AAA CCA TCC TTT GGG ATG AAG GTA GCT CAT GCA AAC AGC ACA TCC 585 
Lys Pro Ser Phe Gly MET Lys Val Ala His Ala Asn Ser Thr Ser 195 



40 



AAA GAT AAA CCA AAG GCA GAA GAG CCT CCT GCA TCA GTG AAA GCC 630 
Lys Asp Lys Pro Lys Ala Glu Glu Pro Pro Ala Ser Val Lys Ala 210 



45 



ATA AGA CCT GTG ACT CAG GCT GCC ACA GTG ACA GAG AAG AAG AAA 675 
He Arg Pro Val Thr Gin Ala Ala Thr Val Thr Glu Lys Lys Lys 225 



so 



CTG AGG GCT GCT GAC TTC AAG ACT GAG CCA CAG TGG GAT TTT GAT 720 
Leu Arg Ala Ala Asp Phe Lys Thr Glu Pro Gin Trp Asp Phe Asp 240 
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GAT GAG TAC ATA CTG GAT AGC TCA TCT CCA GTA TCG ACC TGC TCT 765 

Asp Glu Tyr He Leu Asp Ser Ser Ser Pro Val Ser Thr Cys Ser 255 

GAA TCA GTG AGA GCC AAG GOT GCC AAG TCT GAC TGG CTG CGA GAT 810 

Glu Ser Val Arg Ala Lys Ala Ala Lys Ser Asp Trp Leu Arg Asp 270 

CTT TTC CTG CCG AAC ATC ACA CTC TTC ATA GAC AAG AGT TAC TTC 855 

Leu Phe Leu Pro Asn He Thr Leu Phe He Asp Lys Ser Tyr Phe .285 



AAT GTC AGT GAG TGG GAC CGC CTG GAG CAT TTT GCA CCT CCC TAT 900 
so Asn Val Ser Glu Trp Asp Arg Leu Glu His Phe Ala Pro Pro Tyr 300 



GGC TTC ATG GAG CTG AAT TAC TCA CTG GTA GAA GAA GTC ATG TCA * 945 

Gly Phe MET Glu Leu Asn Tyr Ser Leu Val Glu Glu Val MET Ser 315 

CGG CTG CCT CCA AAT CCC CAC CAG CAG CTG CTC CTG GCC AAC AGT 990 

Arg Leu Pro Pro Asn Pro Bis Gin Gin Leu Leu Leu Ala Asn Ser 330 

AGC AGC AAC GTG TCA ACG TGC ATC AGC TGT GCT GTT GTG GGG AAT 1035 

Ser Ser Asn Val Ser Thr Cys He Ser Cys Ala Val Val Gly Asn 345 

GGA GGG ATA TTG AAT AAC TCT GGA ATG GGC CAG GAG ATT GAC TCC 1080 

Gly Gly He Leu Asn Asn Ser Gly MET Gly Gin Glu He Asp Ser 360 



45 CAT GAC TAT GTG TTC CGG GTG AGC GGG GCT GTA ATC AAA GGT TAC 1125 

His Asp Tyr Val Phe Arg Val Ser Gly Ala Val He Lys Gly Tyr 375 

50 GAA AAG GAT GTG GGA ACA AAA ACC TCC TTC TAC GGA TTC ACA GCG 1170 

Glu Lys Asp Val Gly Thr Lys Thr Ser Phe Tyr Gly Phe Thr Ala 390 



55 
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TAC TCC CTG GTG TCC TCT CTC CAG AAC TTG GGA CAC AAA GGG TTC 1215 

Tyr Ser Leu Val Ser Ser Leu Gin Asn Leu Gly His Lys Gly Phe 405 

AAG AAG ATC CCA CAG GGG AAG CAT ATC AGA TAC ATT CAC TTC CTG 1260 

Lys Lys lie Pro Gin Gly Lys His He Arg Tyr He His Phe Leu 420 

GAG GCA GTT AGA GAC TAT GAG TGG CTG AAG GCT CTT CTG TTG GAC 1305 

Glu Ala Val Arg Asp Tyr Glu Trp Leu Lys Ala Leu Leu Leu Asp 435 



AAG GAT ATC AGG AAA GGA TTC CTG AAC TAC TAT GGG CGA AGG CCC 1350 

so Lys Asp He Arg Lys Gly Phe Leu Asn Tyr Tyr Gly Arg Arg Pro 450 

CGG GAG AGA TTC GAT GAA GAT TTC ACA ATG AAT AAG TAC CTG GTA 1395 

25 Arg Glu Arg Phe Asp Glu Asp Phe Thr MET Asn Lys Tyr Leu Val 465 



GCT CAC CCT GAT. TTC CTC AGA TAC TTG AAA AAC AGG TTC TTA AAA 1440 

Ala His Pro Asp Phe Leu Arg Tyr Leu Lys Asn Arg Phe Leu -Lys 480 

TCT AAA AAT CTG CAA AAG CCC TAC TGG CGG CTG TAC AGA CCC ACA 1485 

Ser Lys Asn Leu Gin Lys Pro Tyr Trp Arg Leu Tyr Arg Pro Thr 495 

ACA GGA GCC CTC CTG CTG CTG ACT GCC CTG CAT CTC TGT GAC CGG 1530 

Thr Gly Ala Leu Leu Leu Leu Thr Ala Leu His Leu Cys Asp Arg 510 



45 GTG AGT GCC TAT GGC TAC ATC ACA GAA GGT CAC CAG AAG TAC TCG 1575 

Val Ser Ala Tyr Gly Tyr He Thr Glu Gly His Gin Lys Tyr Ser 525 

SO GAT CAC TAC TAT GAC AAG GAG TGG AAA CGC CTG GTC TTC TAC GTT 1620 

Asp His Tyr Tyr Asp Lys Glu Trp Lys Arg Leu Val Phe Tyr Val 540 
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AAC CAT GAC TTC AAC TTG GAG AAG CAG GTG TGG AAA AGG CTT CAT 1665 

Asn His Asp Phe Asn Leu Glu Lys Gin Val Trp Lys Arg Leu His 555 

GAT GAG AAC ATC ATG AAG CTC TAC CAG AGA TCC TGA CAG TGT GCC 1710 

Asp Glu Asn lie MET Lys Leu Tyr Gin Arg Ser — 566 



GAGGGCCATT GCCTGGGAAA TCTCAACAGC ACCTCATGGG GAACAGAAGA 1760 

GGACCTCGGA AGCCAGGGTT AGCTCTGGAC TTCCAGGCCC AGCTTCAGCT 1810 

CCACAGAGAT ATTTCCCTCC TTTGATATCT TTATTTTCTC ACAACACTTC 1860 

CTAAAATGTG CATATTCTAC AGACCAAGCG AACAGTAGGG AAAAGTGCCT 1910 

CCAAACAAGG TCCCATCTGA CTTGTGGACG GTTGTAGGCT CTGGTACTGG 1960 

GAAAGAGGAA TCCGGGATGA ATCCGAATAG CAGATGTTCC AGTGCCCATT 2010 

ATCTTAATCA GGTTCTCCCT CTGCAAGGAG ATGCTCTTGG GGCTGGGGCT * 2060 

22 AGTTTTGCTC TAGGTGGGTT CTCTCTGTGA GTAGTGCTTG TTATGGAGCT 2110 

GGGTGTTTTG GGTAAGCAGT GGATAGAATG GAGACACACA CAATCCTGTC 2160 

TCAAGAGGAT GATTTGTGTC CTGGAGGTGC TGCTGTCACT CTGCTCACTG 2210 

30 CAGGCATAAG GACCCTTCCA ATGAACTCAA TCCCAATGTG ACTTTGCTGT 2260 

GACACCTCCT GGGGAGCACT GTGATGTCGG TGCCCAGCCT GCTGCCCTTG 2310 

GCCTAGTTCA CCATCAGCAC AAGGGAAGGG GAGAGCCCTC CGTAGTGCAG 2360 

35 CAGAATGCTG GACATTGTAC CTCTTGCTGT GGGTTCCCCT GGCTGCAGAC 2410 

TACGTGTAGT GAGTCTGATG AAGAAGCTGG TGCTTGGCTG TGCCAGGAGC 2460 

ATGGTGCTTC CTCTTCTACC AGGAGAAATG AGAATTCTCA ATGTCCATGG 2510 

*0 ATGGATGCTG TCTGTCCTTG CTGCTGGCTG GAGTGCCTGC CTACATTGTC 2560 

CTGAGAAAAG CACTGTTACA GCCAGTAAGC CTTTGGAGTA TTGGCCTTCT 2610 

GAGTGGGCTT TTGCAAACAA AATAAACGGC ACTGCTTTCC CCCAAGCTGA 2660 

45 AAAAAAAAAA A 2671 



(2) INFORMATION OF SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1294 
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(B) TYPE: nucleic acid 

(C) STRAND NESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

(-43) CTG CAGGGTTTTT ATTTTTAATT TTCTTTCAAA TACTTCCACC 
PstI 

ATG AAA TTC AGC TGG GTC ATG TTC TTC CTG ATG GCA GTG GTT ACA 
MET Lys Phe Ser Trp Val MET Phe Phe Leu MET Ala Val Val Thr 

GGG GTC AAT TCA GAA TTC ACT GAG CCA CAG TGG GAT TTT GAT GAT " 
Gly Val Asn Ser Glu Phe Thr Glu Pro Gin Trp Asp Phe Asp Asp 

GAG TAC ATA CTG GAT AGC TCA TCT CCA GTA TCG ACC TGC TCT GAA 
Glu Tyr lie Leu Asp Ser Ser Ser Pro Val Ser Thr Cys Ser Glu 

TCA GTG AGA GCC AAG GOT GCC AAG TCT GAC TGG CTG CGA GAT CTT 
Ser Val Arg Ala Lys Ala Ala Lys Ser Asp Trp Leu Arg Asp Leu 

TTC CTG CCG AAC ATC ACA CTC TTC ATA GAC AAG AGT TAC TTC AAT 
Phe Leu Pro Asn lie Thr Leu Phe lie Asp Lys Ser Tyr Phe Asn 

GTC AGT GAG TGG GAC CGC CTG GAG CAT TTT GCA CCT CCC TAT GGC 
Val Ser Glu Trp Asp Arg Leu Glu His Phe Ala Pro Pro Tyr Gly 
60 70 

TTC ATG GAG CTG AAT TAC TCA CTG GTA GAA GAA GTC ATG TCA CGG 
Phe MET Glu Leu Asn Tyr Ser Leu Val Glu Glu Val MET Ser Arg 
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CFG CCT CCA AAT CCC CAC CAG CAG CTG CTC CTG GCC AAC AGT AGO 
Leu Pro Pro Asn Pro His Gin Gin Leu Leu Leu Ala Asn Ser Ser 

AGC AAC GTG TCA ACG TGC ATC AGC TGT GCT GTT GTG GGG AAT GGA 
Ser Asn Val Ser Thr Cys lie Ser Cys Ala Val Val Gly Asn Gly 

GGG ATA TTG AAT AAC TCT GGA ATG GGC CAG GAG ATT GAC TCC CAT 
Gly He Leu Asn Asn Ser Gly MET Gly Gin Glu He Asp Ser His 

GAC TAT GTG TTC CGG GTG AGC GGG GCT GTA ATC AAA GGT TAC GAA 
Asp Tyr Val Phe Arg Val Ser Gly Ala Val He Lys Gly Tyr Glu 

AAG GAT GTG GGA ACA AAA ACC TCC TTC TAC GGA TTC ACA GCG TAC 
Lys Asp Val Gly Thr Lys .Thr Ser Phe Tyr Gly Phe Thr Ala Tyr 

TCC CTG GTG TCC TCT CTC CAG AAC TTG GGA CAC AAA GGG TTC 'AAG 
Ser Leu Val Ser Ser Leu Gin Asn Leu Gly His Lys Gly Phe Lys 

AAG ATC CCA CAG GGG. AAG CAT ATC AGA TAC ATT CAC TTC CTG GAG 
Lys He Pro Gin Gly Lys His He Arg Tyr He His Phe Leu Glu 

GCA GTT AGA GAC TAT GAG TGG CTG AAG GCT CTT CTG TTG GAC AAG 
Ala Val Arg Asp Tyr Glu Trp Leu Lys Ala Leu Leu Leu Asp Lys 

GAT ATC AGG AAA GGA TTC CTG AAC TAC TAT GGG CGA AGG CCC CGG 
Asp lie Arg Lys Gly Phe Leu Asn Tyr Tyr Gly Arg Arg Pro Arg 

GAG AGA TTC GAT GAA GAT TTC ACA ATG AAT AAG TAC CTG GTA GCT 
Glu Arg Phe Asp Glu Asp Phe Thr MET Asn Lys Tyr Leu Val Ala 
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30 



35 
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CAC CCT GAT TTC CTC AGA TAC TTG AAA AAC AGG TTC TTA AAA TCT 810 

His Pro Asp Phe Leu Arg Tyr Leu Lys Asn Arg Phe Leu Lys Ser 270 

AAA AAT CTG CAA AAG CCC TAC TGG CGG CTG TAC AGA CCC ACA ACA 855 

Lys Asn Leu Gin Lys Pro Tyr Trp Arg Leu Tyr Arg Pro Thr Thr 285 



GGA GCC CTC CTG CTG CTG ACT GCC CTG CAT CTC TGT GAC CGG GTG 900 
15 Gly Ala Leu Leu Leu Leu Thr Ala Leu His Leu Cys Asp Arg Val 300 



AGT GCC TAT GGC TAC ATC ACA GAA GGT CAC CAG AAG TAC TCG GAT 945 

Ser Ala Tyr Gly Tyr lie Thr Glu Gly His Gin Lys Tyr Ser Asp 315 

CAC TAC TAT GAC AAG GAG TGG AAA CGC CTG GTC TTC TAC GTT AAC 990 

His Tyr Tyr Asp Lys Glu Trp Lys Arg Leu Val Phe Tyr Val Asn 330 

CAT GAC TTC AAC TTG GAG AAG CAG GTG TGG AAA AGG CTT CAT- GAT 1035 

His Asp Phe Asn Leu Glu Lys Gin Val Trp Lys Arg Leu His Asp 345 

GAG AAC ATC ATG AAG CTC TAC CAG AGA TCC TGA CAGTGTGCCGAG 1080 

Glu Asn lie MET Lys Leu Tyr Gin Arg Ser — 360 



40 GGCCATTGCC TGGGAAATCT CAACAGCACC TCATGGGGAA CAGAAGAGGA 1175 

CCTCGGAAGC CAGGGTTAGC TCTGGACTTC CAGGCCCAGC TTCAGCTCCA 1225 

CAGAGATATT TCCCTCCTTT GATATC 1251 

4* ECORV 



(2) INFORMATION OF SBQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1666 
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(B) TYPE: nucleic acid 

(C) STRANDNESS: double 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISMS: G. ga-llus (chicken) 

(D) OTHER INFORMATION: CDS 1-1212 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

-384 -TCTTTTTGTTTGTCATC AGTGT AA 

ATAGGAAGAGCACAAAGTCATTTCTOTC^ 

TACACGTGTTGTACAGTATGCTTAAACAGTCCTTGATGAGGTCATCGCTC 

TTTTCTGTGTCATAAGAGTTTGGGTTTOCCGCGM 

CGAGTCTGAAC«GTCCGCCCAGCGCCrCCTCCTCCGGCT 

GCTCCTCGGAACATCCATCGCTCCGTCCGTCCATCCGTGCGTCCATCCCGGCT 

GCAGCGGAGCGCCGCCGCTTCGGATCCACGCGGACGGCGGACCCAACCCGGAGAGCAGCG 

ATG GGT TCC CCC CGC TGG AAG. CGT TTC TGC TTC TTG CTC CTC GCA 
MET Gly Ser Pro Arg Trp Lys Arg Phe Cys Phe Leu Leu Leu Ala 

GCC TTC ACC TCG TCC CTT CTG CTC TAC GGG CAC TAC TAC GCT ACG 
Ala Phe Thr Ser Ser Leu Leu Leu Tyr Gly His Tyr Tyr Ala Thr 

GTG GAC GTG CGC AGC GGC CCG AGG GTC GTG ACC AGC CTG CTG CAG 
Val Asp Val Arg Ser Gly Pro Arg Val Val Thr Ser Leu Leu Gin 

CCA GAG CTG CTG TTC CTG GTC CGC CCA GAC ACC CCA CAC CCA GAC 
Pro Glu Leu Leu Phe Leu Val Arg Pro Asp Thr Pro His Pro Asp 

AAC AGC CAC CAC AAG GAG CTC AGA GGG ACT GTG AAG AGC AGG GAG 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



Asn 


Ser 


His 


His 


Lys 


Glu 


Leu 


Arg 


Gly Thr Val Lys Ser Arg Glu 


75 


TTC 


TTC 


TCC 


CAA 


CCA 


TCC 


TCA 


GAG 


CTG GAG AAG CCC AAA CCC AGT 


270 


Phe 


Phe 


Ser 


Gin 


Pro 


Ser 


Ser 


Glu 


Leu Glu Lys Pro Lys Pro Ser 


90 


GGA 


AAG 


CAG 


CCC 


ACC 


CCG 


TGC 


CCC 


CGC TCG GTG GCA GCC ACG GCG 


315 


Gly 


Lys 


Gin 


Pro 


Thr 


Pro 


Cys 


Pro 


Arg Ser Val Ala Ala Thr Ala 


105 


AAG 


GCA 


GAC 


CCC 


ACG 


TTT 


GGG 


GAG 


CTC TTC CAA TTT GAC ATC CCG 


360 


Lys 


Ala 


Asp 


Pro 


Thr 


Phe 


Gly 


Glu 


Leu Phe Gin Phe Asp He Pro 


120 


GTG 


CTG 


ATG 


TGG 


G^C 


CAA 


CAC 


TTC 


AAC CCT GAG ACG TGG GAC AGG 


405 


Val. 


Leu 


Met 


Trp 


Asp 


Gin 


His 


Phe 


Asn Pro Glu Thr Trp Asp Arg 


135 


CTG 


AAG 


GCA 


CGA 


CGC 


GTC 


CCA 


TAC 


GGC TGG CAG GGT TTG TCC CAA 


450 


Leu 


Lys 


Ala 


Arg 


Arg 


Val 


Pro 


Tyr 


Gly Trp Gin Gly Leu Ser Gin 


150 


GCA 


GCT 


GTC 


GGC 


AGC 


ACC 


CTG 


CGT 


CTC CTT AAC ACC TCC TCC AAC 


495 


Ala 


Ala 


Val 


Gly 


Ser 


Thr 


Leu 


Arg 


Leu Leu Asn Thr Ser Ser Asn 


165 


ACG 


CGG 


CTC 


TTC 


GAC 


CGC 


CAC 


CTC 


TTC CCC GGG GGC TGC ATC CGC 


540 


Thr 


Arg 


Leu 


Phe 


Asp 


Arg 


His 


Leu 


Phe Pro Gly Gly Cys He Arg 


180 


TGT 


GCC 


GTG 


GTG 


GGC 


AAT 


GGG 


GGA 


ATC CTC AAC GGC TCA CGG CAG 


585 


Cys 


Ala 


Val 


Val 


Gly 


Asn 


Gly 


Gly 


He Leu Asn Gly Ser Arg Gin 


195 


GGC 


CGG 


GCC 


ATC 


GAC 


GCA 


CAT 


GAT 


TTG GTC TTC AGG CTG AAC GGG 


630 


Gly 


Arg 


Ala 


He 


Asp 


Ala 


His 


Asp 


Leu Val Phe Arg Leu Asn Gly 


210 


GCC 


ATC 


ACC 


AAA 


GGC 


TTT 


GAG 


GAG 


GAT GTT GGG AGC AAG GTT TCG 


675 


Ala 


lie 


Thr 


Lys 


Gly 


Phe 


GlU 


GlU 


Asp Val Gly Ser Lys Val Ser 


225 
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TTC 


TAC GGC 


TTC 


5 


Phe 


Tyr Gly 


Phe 




TAT 


GAG GCG 


TAT 


10 


Tyr 


Glu Ala 


Tyr 




AAG 


TAC ATC 


TTC 


15 . 


Lys 


Tyr He 


Phe 




AGG 
Arg 


TCG GCC 
Ser Ala 


ATT 
He 


25 


GGC 


GAC GAG 


CCA 




Gly 


Asp GlU 


Pro 


30 


AAG 


TTC AAG 


CTG 




Lys 


Phe Lys 


Leu 


35 


AGG 


TTC CTG 


AGG 




Arg 


Phe Leu 


Arg 


40 


TAC 


ATG CCC 


AGC 




Tyr 


Met Pro 


Ser 


45 










ACC 


TGC GAC 


CAG 




Thr 


Cys Asp 


Gin 


50 


GAG 


CAG TTC 


TCC 




GlU 


Gin Phe 


Ser 


55 









ACG GTG AAC ACC ATG AAG AAC 
Thr Val Asn Thr Met Lys Asn 

GGC TTC ACC CGG ACA CCG CAG 
Gly Phe Thr Arg Thr Pro Gin 

ATC CCC TCG GAC GCA CGC GAC 
He Pro Ser Asp Ala Arg Asp 

CAG GGC AGC CCA GTC CCC GAG 
Gin Gly Ser Pro Val Pro Glu 

CAG AAG TAT TTT GGA CTG GAG 
Gin Lys Tyr Phe Gly Leu Glu 

CTG CAT CCC GAT TTC TTG CAT 
Leu His Pro Asp Phe Leu His 

TCA GAG CTC CTG GAC ATG CAG 
Ser Glu Leu Leu Asp Met Gin 

ACT GGG GCA CTC ATG CTG CTG 
Thr Gly Ala Leu Met Leu Leu 

GTC AGT GCC TAC GGG TTC ATC 
Val Ser Ala Tyr Gly Phe He 
GAC CAT TAC TAC GAG CCA GAG 
Asp His Tyr Tyr Glu Pro Glu 



TCA CTC 


ATT GCC 


720 


Ser Leu 


He Ala 


240 


GGC AAG 


GAC CTG 


765 


Gly Lys 


Asp Leu 


255 


TAC ATC 


ATG CTG 


. 810 


Tyr He 


Met Leu 


270 


GGC TTG 


GAC AAG 


855 


Gly Leu 


Asp Lys 


285 


GCA TCT 


GCG GAG 


900 


Ala Ser 


Ala Glu 


300 


TAC CTG 


ACA' ACC 


945 


Tyr Leu 


Thr Thr 


315 


TAC GGC 


CAC CTC 


990 


Tyr Gly 


His Leu 


330 


ACA GCA 


CTG CAC 


1035 


Thr Ala 


Leu His 


345 


ACA GCC 


AAC TAC 


1080 


Thr Ala 


Asn Tyr 


360 


AAG AAG 


CCA CTG 


1125 


Lys Lys 


Pro Leu 


375 
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GTG TTC TAC GCC AAC CAC GAC ATG CTG CTG GAA GCA GAG CTG TGG 1170 
Val Phe Tyr Ala Asn His Asp Met Leu Leu Glu Ala Glu Leu Trp 390 

AGG AGT TTG CAC CGG GCG GGG ATC ATG GAG CTG TAC CAG CGG TGA 1215 
Arg Ser Leu His Arg Ala Gly lie Met Glu Leu Tyr Gin Arg 404 

GGGCAGCGCAGTCCCACTGCAAGGACTCTCAATGCMCGCAG 1275 

CTGAAGGGCTCCTTCTGTCCCTGGAGGGCTCTCCCACACTGGCGGGCC 1335 
AGGGCCTGCAGCTGACAGCAGAGCAAAGGTGGTGGTGCAGGGCGAGCCAAGGCTGGCAC^ 1395 

GAAATACTGCAACTCCTCAGGGCCCTTCAGCATCTTATTTGTGACTC^ 1455 

GGCCTTGGGGAGCCTCCGCACGTGGCIGTGAGCTCCTGATGCCATGAGAA 1515 

GTGGCAGCAGCCCCTGGGAAGCACAGTGTTCATGTGCAG^ 1575 

AGGGGATGCTGGAGAAGCATACATCTGACAGACCTCACTTCTTGGAACTT^ 163 5 

CAGCCTCGAAGTCACGCTGGGTAGGCTGCAG 1666 

(2) INFORMATION OF SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1152 

(B) TYPE: nucleic acid 
• (C) STRANDNESS: double 

(D) TOPOLOGY; linear 
(ii) MOLECULE TYPE: DNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISMS: mouse 

(D) OTHER INFORMATION: 1-1128 sialyltransf erase in soluble 

form 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

(-6) TGGCAT -1 
Ndel 
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ATG GGG AGC GAC TAT GAG GCT CTT ACA TTG CAA GCC AAG GTA TTC 45 
MET Gly Ser Asp Tyr Glu Ala Leu Thr Leu Gin Ala Lys Val Phe 15 



10 



CAG ATG CCG AAG AGC CAG GAG AAA GTG GCC GTG GGG CCT GCT CCC 90 
Gin MET Pro Lys Ser Gin Glu Lys Val Ala Val Gly Pro Ala Pro 30 



75 



CAG GCT GTG TTC TCA AAC AGC AAA CAA GAC CCT AAG GAA GGC GTT 135 
Gin Ala Val Phe Ser Asn Ser Lys Gin Asp Pro Lys Glu Gly Val 45 



20 



CAG ATC CTC AGT TAC CCC AGG GTC ACA GCC AAG GTC AAG CCA CAG 180 
Gin He Leu Ser Tyr Pro Arg Val Thr Ala Lys Val Lys Pro Gin 60 



25 



CCC TCC TTG CAG GTG TGG GAC AAG GAC TCC ACA TAC TCA AAA CTT 225 

Pro Ser Leu Gin Val Trp Asp Lys Asp Ser Thr Tyr Ser Lys Leu 75 



30 



AAC CCC AGG CTG CTG AAG ATC TGG AGG AAC TAT CTG AAC ATG AAT 270 
Asn Pro Arg Leu Leu Lys He Trp Arg Asn Tyr Leu Asn MET Asn 90 



35 



AAA TAT AAA GTG TCC TAC AAG GGG CCG GGA CCA GGA GTC AGG TTC 315 
Lys Tyr Lys Val Ser Tyr Lys Gly Pro Gly Pro Gly Val Arg Phe 105 



40 



AGC GTA GAA GGC CTG CGC TGC CAC CTT CGA GAC CAC GTG AAT GTG 360 
Ser Val Glu Gly Leu Arg Cys His Leu Arg Asp His Val Asn Val 120 



45 



TCT ATG ATA GAG GCC ACA GAT TCT CCC TTC AAC ACC ACT GAA TGG 405 
Ser MET He Glu Ala Thr Asp Ser Pro Phe Asn Thr Thr Glu Trp 135 



50 



GAG GGT TAC CTG CCC AAA GAG ACA TTC AGA ACC AAG GCT GGG CCT 450 
Glu Gly Tyr Leu Pro Lys Glu Thr Phe Arg Thr Lys Ala Gly Pro 150 
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TGC 


ACA 


AAG TGT 


GCC GTC GTG TCT TCT GCA GGA TCT CTG AAG AAC 


495 


5 


Cys 


Thr 


Lys Cys 


Ala Val Val Ser Ser Ala Gly Ser Leu Lys Asn 


165 




TCC 


CAG 


CTG GGT 


CGA GAG ATT GAT AAT CAT GAT GCG GTC CTG AGG 


540 


10 


Ser 


Gin 


Leu Gly 


Arg Glu He Asp Asn His Asp Ala Val Leu Arg 


180 




TTT 


AAT 


GGG GCA 


CCT ACA GAC AAC TTC CAA CAG GAT GTG GGC ACA 


585 


15 


Phe 


Asn 


Gly Ala 


Pro Thr Asp Asn Phe Gin Gin Asp Val Gly Thr 


195 




AAA 


ACT 


ACC ATC 


CGC CTA GTG AAC TCT CAG TTA GTC ACC ACA GAA 


630 


20 


Lys 


Thr 


Thr He 


Arg Leu Val Asn Ser Gin Leu Val Thr Thr Glu 


210 


25 


AAG 


CGC 


TTC CTG 


AAG GAC AGT TTG TAC ACC GAA GGA ATC CTG ATT 


675 


Lys 


Arg 


Phe Leu 


Lys Asp Ser Leu Tyr Thr Glu Gly He Leu He 

• • 


225 


30 


CTG 


TGG 


GAC CCA 


TCT GTG TAT CAT GCA GAC ATT CCG CAG TGG' TAT 


720 




Leu 


Trp 


Asp Pro 


Ser Val Tyr His Ala Asp He Pro Gin Trp Tyr 


240 




CAG 


AAG 


CCA GAC 


TAC AAC TTC TTC GAA ACC TAT AAG AGT TAC CGA 


765 


35 


Gin 


Lys 


Pro Asp 


Tyr Asn Phe Phe Glu Thr Tyr Lys Ser Tyr Arg 


255 




AGG 


CTT 


CAC CCC 


AGC CAG CCT TTT TAC ATC CTC AAG CCC CAG ATG 


810 


40 


Arg 


Leu 


His Pro 


Ser Gin Pro Phe Tyr He Leu Lys Pro Gin MET 


270 


45 


CCA 


TGG 


GAA CTA 


TGG GAC ATC ATT CAG GAA ATC TCT CCA GAT CTG 


855 




Pro 


Trp 


Glu Leu 


Trp Asp He He Gin Glu lie Ser Pro Asp Leu 


285 


50 


ATT 


CAG 


CCG AAT 


CCC CCA TCC TCC GGC ATG CTG GGT ATC ATC ATT 


900 




lie 


Gin 


Pro Asn 


Pro Pro Ser Ser Gly MET Leu Gly He He He 


300 
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ATG ATG ACG CTG TGT GAC CAA GTT GAT ATT TAC GAG TTC CTC CCA 945 

MET MET Thr Leu Cys Asp Gin Val Asp lie Tyr Glu Phe Leu Pro 315 

5 

TCC AAG CGC AAG ACA GAT GTG TGC TAC TAT CAC CAG AAG TTC TTT 990 

10 ser Lys Arg Lys Thr Asp Val Cys Tyr Tyr His Gin Lys Phe Phe 330 



15 



30 



GAC AGC GCC TGC ACG ATG GGT GCC TAC CAT CCG CTC CTC TTC GAG 1035 
Asp Ser Ala Cys Thr MET Gly Ala Tyr His Pro. Leu Leu Phe Glu 345 



20 AAG AAT ATG GTG AAG CAT CTC AAT GAG GGA ACA GAT GAA GAC ATT 1080 

Lys Asn MET Val Lys His Leu Asn Glu Gly Thr Asp Glu Asp He 360 

25 

TAT TTG TTT GGG AAA GCT ACC CTG TCT GGC TTC CGG AAC AAT CGC* 1125 

Tyr Leu Phe Gly Lys Ala Thr Leu Ser Gly Phe Arg Asn Asn Arg 375 



TGT TGA GCCAGGGATCCTCAT 1146 
Cys BamHI 3 76 



35 



40 Claims 

1. GalNAca2,6-siaJyltransferase P-B1 characterized by the amino acid sequence defined by SEQ ID No.1 of the 
sequence listings. 

45 2. A G^NAcc£,6~sia)yltransferase P-B1 gene encoding the amino acid sequence of GaINAca2,6-sialyftransferase P- 
B1. 

3. The GalNAca2,6-sia!yl transferase gene according to claim 2 characterized by the nucleotide sequence of from 
nucleotides 1 to 1698 defined by SEQ ID No.1 of the sequence listings. 

50 

4. GalNAca2,6-siaIyltransferase P-B3 characterized by the amino add sequence defined by SEQ ID Na3 of the 
sequence listings. 

5. A GalNAca2,6-$ialyttransferase P-B3 gene encoding the amino acid sequence of GalNAca2 f 6^ia!yttransferase P- 

55 B3. 

6. The GalNAca2,6^alyltransferase P-B3 gene aocording to claim 5 characterized by the nucleotide sequence of 
from nucleotides 1 to 1212 defined by SEQ ID No.3 of the sequence listings. . 



EP0737745A1 

7. A recombinant vector comprising a GaiNAco2 l 6-sialyttransf erase gene encoding the amino acid sequence of the 
GaINAca2,6-sialyttransferase P-B1. 

8. The recombinant vector according to daim 7 which is plasmid XCEB-3034. 

9. A recombinant vector comprising a GaiNAca2 t 6-siaJyttransf erase gene encoding the amino acid sequence of the 
GalNAca2,6-sialyttransferase P-B3. 

10. Therecombinamvedorofdaim9whichispta^ 

70 

11. A trarisformant which is being transformed with a recombinant vector comprising a GalNAca2,6-siaJyttransf erase 
gene encoding the amino add sequence of the GalNAca2,6-sialyltransf erase P-B1. 

12. A transformant which is being transformed with a recombinant vector comprising a GaJNAca2,6-siaiyftransf erase 
is gene encoding the amino acid sequence of the GaINAca2,6-sialyftransferase P-B3. 

1 3. An extraceltularly releasable protein catalyzing GaIN Aco2 ,6-sialic acid transfer which comprises a polypeptide por- 
tion as being an active domain of the GaINAca2,6-sialyttransferase P-B1 or P-B3 together with a signal peptide. 

so 14. The protein according to daim 13, wherein the active domain of the GalNAca2,6-sialyforansferase P-B1 is the 
polypeptide characterized by amino acids of from 233 to 566 defined by SEQ ID No.1 of the sequence listings. 

15. The protein according to claim 13 which is the protein SB-690 charaderized by the amino add sequence defined 
by SEQ ID Na2 of the sequence listings. 

25 

16. A gene encoding an extracellularly releasable protein catalyzing GaiNAca2 t 6-sialic add transfer which comprises 
a polypeptide portion as being an active domain of the GalNAca2,6-siaJyttransferase P-B1 or P-B3 together with a 
signal peptide, 

30 17. The gene according to daim 16 characterized by the nudeotide sequence of from nucleotides 1 to 1065 defined by 
SEQ ID No.2 of the sequence listings. 

18. A recombinant vector comprising a gene encoding an extracellularly releasable protein catalyzing GalNAca2,6- 
sialic acid transfer which comprises a polypeptide portion as being an active domain of the GalNAca2,6-sialyltrans- 

$5 ferase P-B1 or P-B3 together with a signal peptide. 

19. The recombinant vedor according to daim 18 which is plasmid pcDSB-690. 

20. A transformant which is being transformed with a recombinant vector comprising a gene encoding an extracellularly 
40 releasable protein catalyzing GalNAca2 I 6-siaIic add transfer which comprises a polypeptide portion as being an 

active domain of the GalNAca2,6-sialyttransferase P-B1 or P-B3 together with a signal peptide. 

21. A process for preparing an extracellularly releasable protein catalyzing GaIN Aco2,6-sia!ic add transfer which com- 
prises a polypeptide portion as being an active domain of the Ga!NAca2,6-sia)yttransf erase P-B1 or P-B3 together 

45 with a signal peptide, which comprises the steps of culturing a transformant which is being transformed with a 
recombinant vector comprising a gene encoding an extracellularty releasable protein catalyzing GalNAca2,6-sialic 
add transfer which comprises a polypeptide portion as being an active domain of the Ga1NAca2,6-sia!y1 transferase 
P-B1 or P-B3 together with a signal peptide, and recovering said protein from the culture. 

so 22. GaJNAca2,6-aaly1transferase. 

23. GalNAca2,6-siafy [transferase derived from mammals. 

24. A process for preparing a sialyttransferase which comprises the steps of: 

55 

(a) expressing a sialyttransferase in a microorganism; 

(b) extracting the sialyttransferase using 5 to 9 M urea from proteinic aggregate or precipitate that contains the 
enzyme and is accumulated inside the microorganism; 
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(c) diluting the extract obtained by the step (b) with a renaturation composition to obtain a primary dilution con- 
taining 1 to 4 M urea; 

(d) diluting the primary dilution obtained by the step (c) with a renaturation composition to obtain a secondary 
dilution containing 0.5 to 2 M urea; and 

(e) removing urea from the secondary dilution obtained by the step (d) by dialysis to afford a renatured sialyl- 
transferasa 

25. The process according to claim 24, wherein the renaturation composition used in the step (c) comprises 1 to 2 M 
urea, 20 mM MOPS-NaOH, 0.5M NaCI, 20 mM lactose, and 0.5 mM EDTA (pH 7.0); and the renaturation compo- 
sition used in the step (d) comprises 20 mM MOPS-NaOH, 0.5 M NaCI, 20 mM lactose, and 0.5 mM EDTA (pH 7.0). 

26. The process according to claim 24, wherein the sialyltransferase is Ga!NAca2,6-sialyltransferase or 
Galp1 i 4GlcNAco2,6-sialyltransferase. 

27. A process for preparing a sialyltransferase which comprises the steps of. 

(a) expressing a sialyltransferase in a microorganism; 

(b) extracting the sialyltransferase using 8 M urea from proteinic aggregate or precipitate that contains the 
enzyme and is accumulated inside the microorganism; 

(c) diluting the extract obtained by the step (b) with a renaturation composition, followed by standing thedBution 
for at least 12 hours at 4 °C to obtain a primary dilution containing 2 to 3 M urea; 

(d) diluting the primary dilution obtained by the step (c) with a renaturation composition, followed by standing 
the dilution for at least 48 hours to obtain a secondary dilution containing 1 to 2 M urea; and 

(e) removing urea from the secondary dilution obtained by the step (d) by dialysis in the presence of one or 
more divalent cations to afford a renatured sialyttransferasa 

28. The process according to claim 27, wherein the renaturation composition used in the step (c) comprises 1 to 2 M 
urea, 20 mM MOPS-NaOH, 0.5 M NaCI, 20 mM lactose, and 0.5 mM EDTA (pH 7.0); and the renaturation compo- 
sition used in the step (d) comprises 20 mM MOPS-NaOH, 0.5M NaCI, 20 mM lactose, and 0.5 mM EDTA (pH 7.0). 

29. A process according to claim 27, wherein the sialyltransferase is GalNAca2,6-sialyttransferase or 
Gaipi ,4GdcNAca2 l 6-siaIyftransferasa 
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